FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTT TGCACAGTGGCAAT TT TGGGACTGAT TTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGT TATG TGC T T CACATGATCAC TACT GCAGCAGAATGGTC TAT GT CAT TTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T GGATT AACCC T CTAT GACAC TGCAC CTT GCCC TATT AACAATGAACGAACACGGCTACT T T CCA 
GAGATAT T TGA T GAAAG GAT AAAATAT T T C T GT AAT GAT TATGATTCT CAGGGAT TGGGGAAAGG 
T TCACAGAAGT TGCT TAT T CT T C TC TGAAAT T T TCAACCAC TTAAT CAAGGCT GACAGTAACACT 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATT TGATAGAT TATT CTAAAGGATATCAT 
CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 



■r.r 



FIGURE 2 



<subunit 1 of 1, 2 66 aa, 1 stop 
<MW: 29766, pi; 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC IAT I YVRYKQVHALSPEENVI IKLNKAGLVLGI LSCLGLS IVANFQKTTLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMS FSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 



Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

Iff N-myristoylation sites. 



amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE ft 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTT CGCC ATGA GT T TCCTCATCGACT CCAGCATCATGAT TACCTCCCAGATACTATTT TTT G 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
M ! CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
\f z CAT T CTCAGCCCAAAA.CAT GGGATCTTAT CCATAGAACAGCTCAT CAGC CGGGTT GGTGTGATT GGAG 

TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
Ul CTCAGGAATGTGACT GACACGGATATTCTAGCC CT GGAACGGC GACT GCTGCAAACCAT GGATAT GAT 

| f| C AT AAGCAAAAAGAAAAGGAT GG CAAT GG CAC G GAGAACAATG TT CCAGAAG GGGGAAG T G CAT AAC A 

10 AAC CAT CAGGT T T CT G G GGAAT GAT AAAAAGT G TTAC CAC T T CAG CAT C AG GAAGT GAAAAT CT T ACT 

*H CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

"L AT AT GCTAC CAAGGAGAGAATAGAATAC T CCAAAAC C T T C AAG GG GAAATAT T T TAAT TTTCTTGGTT 

U 

j; h ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

#4 GGGAAAACGGATCCTGT CACAAGAGGCAT TGAGATCACT GTGAAT TAT CTGGGAATCCAAT TTGATGT 



GAAGTTTTGGTC C CAACACATTT CCTTCATTCTTGT TGGAATAAT CAT CGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 



CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 



AGAATAC C G CAC C AT AATCACT GAAGT C C T TG GAGAACT G C AG TT CAAC T T C T AT C ACC GT T GGT T T G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GG CAC C TTGAAC TT AAGC C TAC TAC AGAC T GT TAGAGG CC AG TGGT T T CAAAAT T T A 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATT TTT CAC CT T CAT AG CAT ACT CCTTCCCCGT CAGGTGAT AC TAT GAC CAT GAGT AGCAT CAG C CAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
T GTAGAG GC GGAGAGGAGC CAAGAAACT AAAG G TGAAAAATACACTG GAAC T C TGGGGCAAGACAT GT 
CTAT GGTAGCT GAGCCAAACACGTAGGATTTCCGTT T TAAGGTTCACAT GGAAAAGGTTATAGCTTTG 
CCT T GAGAT TGAC TCAT TAAAAT CAGAGACT G TA&CAAAAAAAASAAAAAAAAAAAAGGG C GG CC GCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMZlLLSGFGAVNCPYTYMSYFLRNVTDTDIIiALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGI QFDVKFWS QH I S FI LVGI 1 1 VTS I RGLL I T LTKFFYAI S S SKS SNVIVLLLAQIMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGE LQ FNFYHRW FDVI FLVSALS S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE g 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACT CTCCATCCGGAC T AGT TAT TGAGCATCTGC CT C T CATATCACCAGTGGCCAT CT GAGG T 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAAT TATGGGGATCACCCTT GTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCGTATTGTTACAACTCATCTGATACTTGGACTAACTGGTGCATTCCAGAA 
AT TATCACCACCAAAGAT CCCATAT TCAACAC T CAAACT GCAACACAAACAACAGAAT T TAT T GT 
CAGTGACAGTACC TACT CGGT GGCATCCCC T TACT C TACAATACCTGCCCCTAC TAC TAC T CCTC 
f«j CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
Z% GAAAC TAGCACCAT GTCTACAGAAAC T GAACCAT TT GT TGAAAATAAAGCAGCAT TCAAGAAT GA 

If; AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
IP CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
|i| CAGCAGAAGGAAAT GAT C GAAAC CAAAGT AG TAAAGGAGGAGAAGGCCAAT GATAGCAACCCTAA 

• f| TGAGGAAT CAAAGAAAACT GATAAAAACCCAGAAGAGTC CAAGAGT CCAAGCAAAAC TACCGT GC 

m GATGCC T GGAAGC T GAAGT TTAGAT GAGACAGAAATGAGGAGACACACC T GAGGC TGGT T T CT T T 

CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
5 GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
#*| T TC TAGCC T GGC TAT GT C CTAATAATAT CCCACTGGGAGAAAGGAGT T T TGCAAAGTGCAAGGAC 

CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
W GC T C T GAAAGAGAAACACGT AT CCCACCTGACATGTCCT T C TGAGCCCGG TAAGAGCAAAAGAAT 

||| GGCAGAAAAGT T TAGCCCC TGAAAGCCATGGAGAT T C TCAT AACT TGAGACCTAATCT CT GTAAA 

Q GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
175 ACAGACAGGGTCAAAGTGT T T TC TC T GAACACAT TGAGT TGGAATCAC TG T T TAGAACACACACA 

5 ^ C T TAC T T T T TC TGGT CTCT ACCACT GCT GAT AT T T TC TC TAGGAAAT ATACT T T TACAAGTAACA 

AAAATAAAAAC T CT TAT AAAT T TCTAT T T TT AT C TGAGT TACAGAAATGAT TACT AAGGAAGAT T 
ACT CAGT AATT TGT T TAAAAAG TAATAAAAT TCAACAAACAT T T GC TGAATAGCTAC TATATGTC 
AAGTGCT GTGCAAGGTAT TACAC TC TGTAAT TGAATAT TAT TCC TCAAAAAAT TGCACATAGTAG 
AACGC TATCTGGGAAGCTAT T T T T T T CAGT T T T GATATTT CTAGC T TAT CT ACT T CCAAAC TAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT TAACATACCTAAGAAGTACATT GT TACC TC TATATACCAAAGCACAT T T TAAAAGT GCC 
AT TAACAAAT GTATCACT AGCCC TCCTTTT TCCAACAAGAAGGGAC TGAGAGATGCAGAAATAT T 
TGTGACAAAAAATTAAAGCATT TAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWVGDGFWI SRI S PNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
S SDTWTNSCI PE 1 1 TTKDP I FNTQTATQTTE FIVSDSTYSVAS PYS T I PAPTTTPPAPASTS I PR 
RKKL I CVTEVFMETS TMS TETEP FVENKAAFKNEAAG FGGVPTALLVLALL FFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGT GGAAGAGAT GGAGGCAGAAGAAGC T GCT GCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
AC G AAG G T T G GAAAT AAT AC CAT C CAT G T G C AC C G AG AAAT T C ACAAG AT AAC C AAC AAC C AG AC 
T GGACAAATGGT CT T T T CAGAGACAGT TATCACATC T GT GGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
C TCCACT ACCCCACACCAGCCT T GGT GCCACCAAAAGTGC T CCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT TGACAC TGT CCC TCTT TGGCAGT T GCAT TAGTAACT T T GAAAGGTAT AT GAC TGAGCGT AGCA 
TACAGGTT AACC TGCAGAAACAGTAC T T AGGTAAT T GTAGGGCGAGGAT TAT AAAT GAAAT T T GC 
AAAATCAC TTAGCAGCAAC T GAAGACAAT TAT CAACCACGT GGAGAAAATCAAAC CGAGCAGGGC 
TGTGTGAAACAT GGT T GTAATATGCGACT GCGAACAC T GAACTC T ACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATTGTATAAGCATGCTTTCTT TGAGTT TTAAAT TATGTATAAACATAAGT TGCATTTAGAA 
AT CAAGCATAAATCACT T CAACTGCAAAAAAAAAAAAAAAAAAA&AAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPT FVGSRDQDGE I LLPREVPDE YEVGS FMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE o 



CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
H CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
J:f TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
|.y GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
III ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
W GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TCT GGGCAAGAAGAACGAGGCGCCC C CGGACAACAAGAAGAGGAAGAAG TGACAGCT CCGG 
Iff CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATGCAACCTCACTTCGCCTTACAGGT 
O CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
W TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
Sf AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT T AAAGAT AT T T T G T T AACT C 
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FIGURE 10 



RTRGRTRGGCEKVP INT S CNP TAHLWS SCPGLMCVFQGYS SKGL I QRS VFNLQI YGVLGL FWTL 
NT^mALGQCVIAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEKFIKFLNRNAY IMI AI YGKNFCVS AKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 



Important features : 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites . 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 



#=l amino acids 276-28.8 



Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 



Amidation site. 

amino acids 30 6-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCGTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATGCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
T T C TACT T GCT G T CGAT CGC GGCCGT G G C GC T GAT GT T CAT GT AC T ACACT GAGC C CAGC G G CT GCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGA GGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAG I PTVLQGH I DCGS LLGYRAVYRMCFATAAFFFFFFTLLMLCVS S SRDPRAAI Q 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS I AAVALMFMYYTE PSGCHEGKVFI S LNLT FCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALS S I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWVKICMWAGLLLYLWTLVAPLLLRmD 

ff% Signal sequence: 

|y amino acids 1-20 

iy; Transmembrane domains: 

SiS 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
f\ 272-283, 324-340, 391-406, 428-444 

■■■ 

W% 

CJ 

% , « 

■<R*je ! 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT T GGAAAAAGACT CCTGTAACCC TCC T CCAGGATGAACCACC T GCCAGAAGACATGGAGAACG 
C TC TCACCGGGAGCCAGAGC T CCCATGC T T CT CTGCGCAAT ATCCAT T CCAT CAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCAT T GAGAACACAT TAGAGAAGGAGGTGAT G CAGTATGAC T AC TATT CT T CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCC T GGAT TGAGACGTGGT TCCTGGAT T T CAAAGT GT T ACC TCAAGAAGCAGAAGAAGAAAA 
CAGACT CCT GATAGT TCAGGAT GC T T CAGAGAGGGCAGCACT TATACCT GGTGGTC T T TC T GAT G 
GTCAGT T T TAT TCCCC T CCT GAATCCGAAGCAGGAT CT GAAGAAGCT GAAGAAAAACAGGACAGT 
GAGAAACCAC T T T TAGAAC T ATGA GTACT ACT T TT GT T AAAT GTGAAAAACCC TCACAGAAAGT C 
ATCGAGGCAAAAAGAGGCAGGCAG T GGAGTCTCCC T GTCGACAG TAAAGT T GAAAT GGTGACGTC 
CACTGCT GGCT TTAT TGAACAGCTAATAAAGAT TTAT TTATTGTAATACCTCACAAACGTTGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGAC T GAGCCT GAT GT GT T AAC AAAT AGG T GAAG AAAG T C T T GT GCTGTAT T CC TAATC 
AAAAGACT TAATATAT T GAAGTAACACT T T TT TAGTAAGCAAGATACCT T T T TAT TT CAAT TCAC 
AGAATGGAAT TTTTTTGTTT CATGTC TCAGAT T TAT T T TGTAT T TC T T T T T TAACAC T CT ACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C TT GCAAAAGGGGAAGAAAGGAAT T GCGAATACATGTAAAAT GTCACCAGACAT T TGTAT TAT TT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAAT GAC T TAAACCAT TCATATCAT GT TTCCT T T GCGT TCAGCCAAT TT CAATTAAAAT GAA 
CTAAATTAAAAA 



FIGURE id 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQ 

AVTSAFLIJ^ILSKLFSQGAFGYVLPIISFIIAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
§.,*. CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
2t TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
O CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
Cfl TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
111 TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
|f| TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
S« CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
9 ' CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
#=* ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
X GGCC TGTAC T GT GAGAGCCAGATGGGGCAGGGGACAC GGCCCAGCCC TACACCAGTCACGCC GAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
O AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
|J GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
f*% GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCA.ee 
111 CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCT GTG T GAC T C TAGTCT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T TTAGGAACATGTTT T GCTT TT TTAAAATATATATAT TTATAAGAGAT CCT TTCCCATT TATTCT 
GGGAAGATGTT TTTCAAACT CAGAGACAAGGACTTTGGTTT TTGTAAGACAAACGATGATATGAA 
GGCC T T T T GTAAGAAAAAATAAAAGATGAAGT GTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDT7\JSTVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIAALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCVCPLSWFGPWTOESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLG IEPVS PT SLRVGLQRYLQGS S VQLRS LRLT YRNLS GPDKRLVTLRLPAS LAE Y 
J:J TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
*n LIAAIAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATE 
|y CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 

-fees* 

iff Transmembrane domain: 

1*1 amino acids 579-599 

'i - I; 

v ~' ;: EGF-like domain cysteine pattern signature. 

ffz amino acids 430-442 

Leucine zipper pattern. 

amino acids 197-219, 269-291 

N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAAT CC TT AGAT T CCAAGACT AC TT TGACAT CAGAT GAGT CAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
AT TAGAAT CCT CTAT TCAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATAT CAGC TT TC TAGAGTC TCCAAAT CCAGAAAACAAGGAC TATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
H : T T T CCTAGATAAGGAGT AT GAT GAATGTACAT CAGAT GGGAGGGAAGAT GGCAGAC TGTGGT GT G 

|:f C TACAACC TAT GAC TACAAAGCAGATGAAAAGTGGGGC T T TT GTGAAACTGAAGAAGAGGCT GCT 

IK AAGAGACGGCAGATGCAGGAAGCAGAAAT GATGTAT CAAACT GGAAT GAAAAT CC T TAAT GGAAG 

ti| CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAA71AGGCAGCAAGCATGAACCATA 

01 CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
W GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 

CTTTCTGTATGCCTCTGGACTTGGTGTT7\ATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 

SB 

f \ T TGGAGCT CT T GGGGGCAATC TAATAGCCCACATGGTTT T GGTAAGTAGAC T T TAGTGGAAGGC T 

Iff AAT AAT AT TAACATCAGAAGAAT TTGTGGTT T ATAGCGGCCACAAC T T T T TCAGC T T TCAT GAT C 

Q CAGATTTGCT TGTAT TAAGACCAAATAT TCAGT TGAACT T CCT TCAAATT C T TGTTAATGGATAT 

2 AACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACAATTTTTCTTTAAAATGATTAG 
St TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

GAAT TGTC AGAATCAT TT TT TACAT T AGAT TAT CAT AAT T T TAAAAAT TT T TC T T TAGTT TT TCA 
AAAT TT TGTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATATT TTGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C TAAG AT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWC^TTYDYKADEKWGFCETEEEAAKRRQM 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGALGGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myris toylation si tes . 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 

AAT T CAGAT T T TAAGCCCAT T C TGCAGT GGAAT T TCATGAACTAGCAAGAGGACACCATC T T CT T 
GTAT TAT ACAAGAAAGGAGTGTACC TAT CACACACAGGGGGAAAAATGC T C T T T TGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACC TCAGAGAGACT TCGTAC TGTGCTT CTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
M' C CCGGCGT GCTGGCT CCCACT GAC T GGCT GACACTAGAGGAC TACAGAGAACCTATT GAAGT GAA 

y CCT GT TT GGAC TCAT CAGT GT GACAC TAAATATGCT T CCT T T GG TCAAGAAAGC T CAAGGGAGAG 

Jf TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
|gj TATGCAGT GGAAGGT T TCAATGACAGCT T AAGACGGGACATGAAAGC T TT TGGT GTGCACGTCT C 

Iff ATGCAT T GAACCAGGAT T GT TCAAAACAAACT TGGCAGAT CCAGTAAAGGTAAT TGAAAAAAAAC 

ft| T CGCCAT T TGGGAGCAGCT GTC TCCAGACATCAAACAACAAT AT GGAGAAGGT TACAT TGAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
^ CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
ff | TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
Q GCAGAGCTGGCTAATCCCAAGGCAGTG TGA CTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
III AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

tit 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCT C T TAT C TAAAT AT TAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
S TALKAET SERLRT VLLDVTDPENVKRTAQWVKNQVGEKGLWGL INNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTLNMLPLVKKAQGRVINVSSVGGRI^IVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVWSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPD^ 

LSPWECMDHALTSLFPKTHYAAGKDAKI FWI PLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
N Signal peptide: 

::: amino acids 1-17 

m 

III Transmembrane domain: 

VI amino acids 136-152 

N-glycosylation sites. 

**% amino acids 161-163, 187-190 and 253-256 

m 

Glycosaminoglycan attachment site. 

2' amino acids 39-42 

f 'i II 

t M 

N-myristoylation sites . 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATCGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGAT GATGTTGAAGT TGTT TATACAAT TGA 
CAT TCAGAAATATAT T CCATGC TAT CAGCT T T T TAGC T T TT ATAATTC TT CAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATAT TATCAAAT GT CAAAAAGAAT GT GGTAGGT T GG TACAAAT TCCGT 
CGTCAT T CAGATCAGATCAT GAC GT T TAGAGAGAQGC TGC T TCACAAAAAC TTGCAGGAGCAT T T 
TTCAAACCAAGACCT T GT TTT TCTGCTAT TAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
\* ATCGACT GGAAGAT T CCT TATATAAACCT CAAAAAGGACT T TT TCACAGGGTACC TT TAGT GGT T 

y GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
irk TTTTAGCCGAGCAGTACAAACACACAGCTCTAAATTTTTTGAAGAAGATGGATCCTTAAAGGAGG 
hi TACATAAGATAAATGAAAT GTAT GC T T CAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 

Ml GAAGACAGT GAACAAGCAGTAGATAAAC TAGTAAAGGAT GTAAACAGAT TAAAACGAGAAAT T GA 

m ■ 

*ll GAAAAGGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC T CAGGAGA 

"J*' ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
Q ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
|fl AGACAAT CT GACCT T AATGGTAGAACACAC TGACAT TCCT GAAGCTAGT CCAGC TAGTACAC CAC 

fcl AAATCAT TAAGCATAAAGCCT TAGAC T TAGAT GACAGATGGCAATT CAAGAGAT CTCGGTT GT TA 

GATACACAAGACAAACGAT CTAAAGCAAATAC TGGTAGTAG T AACCAAGATAAAGCAT C CAAAAT 

W % 

J-H GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 

CTACAT T T TGAT CC TTT TAACC T TACAAGGAGAT T T T T T TAT TT GGCT GAT GGG TAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T G T T T G C AG T AATAC AC AG AT AAC T C T TAG T GC AT T T AC T T C AC AAAG T AC T T T T T C AAAC AT C A 
GAT GC T T T TAT T TCCAAACC T T T T T T TCACCTT TCAC T AAGT TGTTGAGGGGAAGGC T TACACAG 
AC ACAT T C T T T AGAAT T G G AAAAG T GAGAC C AGG C AC AG T GG C T CAC AC C T G T AAT C C C AG CAC T 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGAC CAT G T CTAT TAAAAAATAAAATGGAAAAGCAAGAATAGCCT TAT TTT CAAAAT AT GGAAA 
GAAAT T TATAT GAAAATT TAT C T GAGTCAT TAAAAT T C TCCT TAAGTGATAC TT TT T TAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAMAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILS]WKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPS 1 1 TES CS THRLEHS LYKPQKGLFHRVPLWANLGMS EQLGYKTVS GS CMS TG FSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKGFGE YSRSPT F 

Important features : 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



a 

Q 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TGCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGATGGAGGAGAAGTACAGCCT G CAAAGTAAAGAT GAT TTCAAAAGT GT GGT CAGCGAACAGTGCAAT CAT TT G 
CAAGCT GT CTTT GCTT CACGT T ACAAGAAGTTT GAT GAAT T CTTCAAAGAACTACT TGAAAAT GCAGAGAAATCCCTG 
AAT GATATGTTT GTGAAGACATATGGCCATTTATACAT GCAAAAT T CT GAGCTATT TAAAGAT CTCTT CGT AGAGTT G 
! : g AAACGT TACTACGTGGTGGGAAATGT GAAC CT GGAAGAAAT GCTAAAT GACT T CT GGG CT CGCCT CCT GGAGCGGAT G 

5 || TTC CGCCT GGT GAACT C CCAGTACCACT TTACAGAT GAGTATCTGGAAT GT GT GAGCAAGTAT ACG GAGCAGCTGAAG 

ff% CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
f" T TAGCGGTTGCGGGAGAT GT CGT GAGCAAGGT CT CCGT G GTAAAC C C CACAGC C CAGT GTAC CCAT GCCCT GT T GAAG 

s ' ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
f*% T GTTTGGC CAACCAAGGG GAT CT CGATTTT GAAT GGAACAATTTCATAGAT GCTAT GCT GAT GGT GGCAGAGAGG CT A 

m GAGGGTCCTTTCAAC^TTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
Q AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
| E § T CCAT CTC T GAAAGT GCCT TCAGT G CTCG CTT CAGACCACAT CACC CCGAGGAACGC C CAAC CACAGCAGCT GGCACT 

Q AGTTT GGACCGACT GGTTACT GAT GT CAAGGAGAAACT GAAACAGG CCAAGAAATT CT GGT CCT CCCT T CCGAGCAAC 

fU GTTTGCAACGATGAGAGGATGGCTGC^GGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 

CTGTTTGCAGT GACAGGAAAT GGATTAGC CAACCAGGG CAACAACC CAGAGGT CCAGGTT GACACCAGCAAACCAGAC 
ATACT GAT CCT T CGT CAAAT CAT G GCT CTT CGAGT GAT GACCAGCAAGAT GAAGAATGCATACAATGGGAACGAC GTG 
GACTT CTTT GATAT CAGT GAT GAAAGTAGT GGAGAAGGAAGT GGAAGT GGCT GT GAGTAT CAGCAGTGC C CTT CAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCT CCTCACT GT C TTCT GCAT CTT GTT CCT GGTTAT G CAGAGAGAGT GGAGATAATT CTCAAACTCT GAG 
AAAAAGT GTT CATCAAAAAGT TAAAAGG CACCAGTTAT CACTTTT CTACCATC CTAGT GAC TT T GCTTTTTAAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGT GCATT GAGTT GGTT CCT GCTCCCC CAAACCAT GT TAAACGT GGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCT GAAAT ATT AAATAGCT GTACAGAAGCAG GT T T TAT TTATCAT GTTAT CTTATTAAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTIAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSEL FKDL FVELKRY YWGNVNLEEMLNDFWARLLERMFRLVNS QYHFTDE YLECVSKYTE 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 

VTVKPCYNYCSNIMRGCIANQGDLDFEWNNFIDAMIMVAERLEGPFNIESVMDPIDWISDAim 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGIANQGNNPEVQV 

KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

KSANEKADSAGVRPGAQAYLLTVFC ILFLVMQREWR . 

Important features : 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . * 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C TGAGTCATCCCCAGGGAT CAGGAGCCT CCAGCAGGGAACCT TCCAT TATATTC T TCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTC T TC CAAT TAAACAT T CT CAGCCAAGAAGACAGT GAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCT TACCCAGGCT TAGGCT TAAT TACC TGAAAGATTCCAGGAAACTGTAGCT TCCT 
AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTT 
AAAT GT C AAAAAAAA&AAAAAAAAA 
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FIGURE 26 



MKVL I S S LLLLL PLMLMSMVS S S LNPGVARGHRDRGQASRRWLQE GGQE CECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
H GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
H C T C T GT G C AGAGC C CAAG GAAGG T CC T G CAGGAGG T CC G GAGAG T AC T GAG AC C GGGAGG T G T GC 

o 

#p TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

|y GAGCCCACCTGGAAACACAT TGGGGAT GGCT GC TGCCTCACCAGAGAGACC TGGAAGGAT C T TGA 

Ifl GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
G GC C CC ACAT C AT GGGAAAGGC T G T C AAAC AAT C T T T CC C AAGC T C CAAG G CAC T CAT T T GCT C C 

r 

* p TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 

?H GCAGAAT GAGAGAAGACATTCATGTACCACCTAC TAGTCCCT CT CT CCCCAACC TCTGCCAGGGC 

W — 

|f| AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
C3 AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
|Jf CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
S's CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGTT TTGT TCTCAAATAT TT TT TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALL^ 

TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQI^GSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQP I YLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATG CTGCTGCT7VA.CGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGT TTATGATTCT TATACTAATT TATACAAAGATATTAAGGCCCT GTTCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 



MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPi\MTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

|!~ amino acids 3-18 

in 

h! 
If! 
CO 



0 
W 

c:i 
ru 
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FIGURE 31 



JUL 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CGTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTC TGCGGT GT TT TATGGT GGC TGTAT TAT GACTATACCAACGACC T CAGCATAGAATT GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
H C TGCTCGTCTTGATTT TT GTTCTCAGAAAGAGAATAAAAT TGACAGT TGAGCTTT TCCAAATCAC 

)z AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 

TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
|J ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
Ul AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
W TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

9-'' 

TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 

3E 

#** GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
111 CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

'S .:- •: 

AACCAGAAT GCATATACTACAACTGC TAT T AAT GGGACAGATT T CT G T ACAT CAGCAAAAGATGC 
ATTCAAAATCT TGTCCAAGAAC TCAAGTCACT TTACATCTATTAACTGCT TTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGATGGAT CGT CAGAAAAGCCC TACT T TAT GGAT CAAGAAT T TC T GAGTT T CGTA 
AAAAGGAG C AACAAAT T AAAC AAT G C AAG GGCAC AGC AG GACAAG CAC T CAT TAAG GAAT GAG GA 
GGGAACAGAAC TCCAGGCCAT TGT GAGATAGATACCCAT T T AGGTAT C TGTACC TGGAAAACAT T 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTTTT 
T TAAAAGACCTAATAAACCCTAT T C T TC C T CAAAA 



App_ID=l 0063584 



Page 181 of 318 



FIGURE 32 

MSGRDT I LGLC ILALALSLAMMFT FRFI TTLLVHI FISLVILGLLFVCGVLWWL YYDYTNDLS IE 
LDT ERENMKCVLG FAI VS TGI TAVLL VL I FVLRKR I KLTVEL FQ I TNKAI S SAP FLL FQPLWT FA 
ILIFFWVLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWT C Y FNRS KNDP PDHPILSSLSILF FYHQGT WKGS FL I S WR I PR I IVMYMQNALKE QQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATT T GT ATGGGAT T T GT GATGCAGGAAAGCCTAAGGGAAAAAGAATAT T CAT TCTGTGTGGT 
GAAAAT T TT T TGAAAAAAAAAT TGCC TT CT TCAAACAAGGGT GTCAT T C T GATAT T T ATGA GGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T C AAAC AAAG AAA3 G G C AAAGAAG AT T AAAA.GG C C C AAG T T C AC T G T G C C T C AGAT C AAC T G C 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGT GGT G TGC T T GATAAT TCAGGAGGGAAAATACT TGT T CGGAAGGTT GC T GGACAGTC T 
GGTTACAT^AGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
p ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
fZ CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
\£ GCCACAGGAGCCAGGAGAT GGAT CTC TGGTCCACT GCCACC TACACAAGCAGCCAAAACAGGCCC 

f\ AGAGCTGATCCAGGTAT CCAAAGGCAAGATCCT TCAGGAGC T GC CT TC CAGAAACC T GT T GGAGC 

GGATGTCAGCCTGGGM:TTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
|p TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
fd CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
*J* CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
ACACGAAT TCT CGAGAT C T GAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGAC T TTCTAAT 
a GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
■Q GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
|H GAC T TGC G AG AGAGT C AG G AAT C AAC AT T T T C T T CAT C AC CAT T GAAG GTGCTGCT GAAAAT GAG 

'%l AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
H GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ill ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
p GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
fll AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 

TGGAGTT TGGGT TCGACAAGTACAGCAGCAAGCCTGACAT CCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GTT TGACAACC T CCAT CAGTAT GTCCCCAGGATCATC CAGAACATT TGTACAGAG TTCAAC T CAC 
AGCCTCGGAAC TGAA T TCAGAGCAGGCAGAGCACCAGCAAGTGC TGCT TTACTAACTGACGTGT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GATCACAAACGTATAGAATGAGCCAAAAGGC TACAT CAT GT TGAGGGTGC TGGAGATT TT ACAT 
T T T GACAAT T GT T T TCAAAATAAATGTT CGGAATACAGT GCAGCCCT TACGACAGGC TTACGT AG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGTCAT GACAATG TAGGAAT TGC TGAATTAAAT GT T TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?\AAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE <\± 

MRTWLTMKAS VI EMFLVLLVTGVHSNKETAKK IKRPKFT VPQ INCDVKAGKI I DPE FI VKCPAG 

CQDPKYHVYGTDVYAS YS SVCGAAVHSGVLDNS GGKI LVRKVAGQS GYKGS YSNGVQSLSLPRWR 

ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 

TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 

VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 

GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RSGAPNVVVVMVDGWPTDKVEEASRIJ^ESGINIFFITIEGAAENEKQYWEPNFANKAV 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 

LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 

FKKS KPNKRKLM I L I TDGRS YDDVR I PAMAAHLKGVI TYAI GVAWAAQEELE VI ATHPARDHS FF 

VDE FDNLHQYVPRI I QNI CTEFNS QPRN 

Important features : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites . 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 



CP 



§ « ii 

o 
ru 



CCGAGC AC AGGAGAT TGCCT GCGT TT AGGAGGT GGCTGC GT T GTGGGAAAAGCT AT C AAGGAAGAAAT TGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT C AGCCT TCCC C ACTAC AAT GTG AT AGAACGCGTGAACTGGAT GT AC TT CT AT GAGT AT GAGC CGATTT A 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGTGG GGAT AT GAGGT T CT TACATT T TTCTT ATT AGGCC AAGAGGCT GAAAAGGAAGAC AAAATGTT GGC 
AT T GT C CT T AGAGGATGAACACCT TCT TT ATGGTGACAT AAT CCGACAAGAT TT TT T AGACACAT AT AAT A 



| i ij ACCT GACCT TGAAAAC C ATT ATGGCAT TCAGGTGGGT AACT G AGT TT T GCCCCAAT GC CAAGT ACGT AAT G 

4fj AAGAC AGACACT GATGTT TT C AT C AAT ACT GGCAAT TT AGTGAAGT AT CT TT T AAACCT AAACC ACTC AGA 

IP GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
«f» CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
^ TTGGTGCC AAGGAT CT AT GAAATG AT GGGT CACGT AAAACCC AT CAAGTT T GAAGAT GT T TATGT C GGGAT 

CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
ATT T GGAT GT CT GT CAACTGAGAC GT GTGATT GCAGCCC AT GGCTTTT CT T CCAAGGAGATC ATC ACT TT T 
TGGC AGGTCAT G CT AAGGAACACC AC ATGCCAT TAT TAACT T CACATTCT AC AAAAAGCCT AGAAGGAC AG 
GAT ACCTTGT GGAAAGTGTT AAAT AAAGT AGGT ACT GT GGAAAAT TC ATGGGGAGGT C AGT GTGCT GGCT T 
AC AC TGAACT GAAACT CATG AAAAAC C C AGACT GGAGACTGG AGGGT T AC ACT T GT GATT T ATT AGTCAGG 
CC CT TCAAAG AT GAT ATGTGGAGGAAT TAAAT AT AAAGGAAT TGGAGGTT TTT GCT AAAGAAAT T AAT AGG 
ACC AAACAAT TT GG AC AT GT CATT CT GT AGACT AGAAT TT CT TAAAAGGGTGT T AC TGAGTT AT AAGCT C A 
CT AGGC TGT AAAAACAAAACAATGT AGAGT TT T AT TTAT TGAAC AAT GTAGTC ACT T GAAGGT TTTGT GT A 
TATCTT AT GT GGAT T AC C AATT T AAAAAT ATATGT AGT TCT GTGTCAAAAAACT TCT TCACT GAAGTT AT A 
CTGAAC AAAATT TT ACCT GT T T T T GGT C ATTT AT AAAGT ACT TCAAGATGTTGC AGT ATT TC AC AGTT AT T 
ATT ATT T AAAAT TACT TC AACT TT GTGT T T TT AAAT GT TTTG ACGATT TC AAT ACAAGAT AAAAAGGAT AG 
TGAATC AT TCTT T ACATGC AAACATTTTCC AGTT AC TT AACT GAT CAGTT T AT T AT TGAT AC ATC ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATAT AGAGAAGAATT AAAGCAAGAAAAT C TGAAAA 



AppJD=10063584 



Page 185 of 318 



FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYWIERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLYKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I KFEDVYVG ICLNLLKVNI HI PEDTNL FFL YRI HLDVCQLRRVI AAHGFS S KE 1 1 T FWQVMLR 
NTTCHY 



Q Important features: 

U Type II transmembrane domain: 

m 

™" ammo acids 20-39 

I s | 

if 

: hr I: 

fz% N-glycosylation sites. 

S amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

<% 

Tl Glycosaminoglycan attachment site. 

'%L amino acids 239-243 

Q Ly-6 / u-PAR domain proteins. 

ammo acids 23-37 



N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGGCGCGGCAAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGTACAC AGTCAT TAAT GAAGCCTGCCC TGGAGCAGAGT GGAATATC ATGT GTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCC TGC C TGATCC ACCC AGGT TGTAC CATC T T TGAAAAC TGC AAGAGC TGC CGAAATGGCT CATGGGGGGGT 
ACC T TGGATGAC T TC TATGTGAAGGGGT TCT AC TGT GCAGAGTGCCGAGCAGGCTGGTACGGAGGAGAC TGCATGCGATGTG 
GCC AGGT TC TGC GAGC CCC AAAGGGTCAGAT TT TGTT GGAAAGC TATCCCC TAAATGCT CACTGTGAATGGAC CATTCATGC 
T AAACCTGGGT T TGT CATC CAAC TAAGAT TTGTCATGTTGAGTC T GGAGTT T GAC TACATGTGCCAGT ATGAC TATGTTGAG 
GTTCGTGATGGAGACAACC GC GATGGCCAGATC AT C AAGC GTGTC TGTGGC AACGAGCGGC CAGC TCC TATC CAGAGCATAG 
J*£> GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
I J ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
fj TATAC TGGGCAGCGC T GTGAAAATCTCC T TGAAGAAAGAAACT GC TCAGACC CTGGGGGCC CAGTCAATGGGTAC CAGAAAA 

f|| TAACAGGGGGCC C TGGGCT TATCAACGGACGCCATGC TAAAATT GGCAC CGTGGTGTCT TT C T T T TGTAAC AACT CC TATGT 

1 5 I TCT TAGTGGCAATGAGAAAAGAAC TTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCAT C T GC ATAAAAGC CT GCCGA 

1 11 GAACCAAAGATT TCAGACC TGGT GAGAAGGAGAGT T C TTCCGATGCAGGT TCAGT CAAGGGAGACACCATTACAC CAGC TAT 

ti\ AC TCAGCGGCC T TCAGC AAGCAGAAACTGCAGAGT GC CCC TACC AAGAAGC CAGC C C T TCCCT T T GGAGATC TGC CCATGGG 

P ATACCAACATCTGCAT ACCC AGC T CCAGTAT GAGT GC ATC T CAC CCT TC TAC CGC C GCC TGGGC AGCAGCAGGAGGACAT GT 

C TGAGGAC TGGGAAGTGGAGTGGGCGGGCAC C ATCC T GCATCCC T ATC TGCGGGAAAAT TGAGAACATCAC T GCTCCAAAGA 
C CCAAGGGT TGC GCTGGCCGTGGC AGGCAGC CATC TACAGGAGGACC AGCGGGGTGC ATGACGGCAGCCTACACAAGGGAGC 
; si GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
^ GTC ACCATGATCAAGACAGC AGACC TGAAAG TTGT T T TGGGGAAATTC TACC GGGAT GATGAC C GGGAT GAGAAGACCATCC 

Tl AGAGCCTAC AGAT T T C T GC TATC ATTC TGCATCCCAACTATGACC CC ATCCT GC T TGATGC TGACATC GC CAT CC TGAAGCT 

Tf CC TAGAC AAGGC C CGTATC AGCACCCGAGTCCAGC C CATC TGCC TCGC TGC CAGTCGGGAT CTCAGCACTT CC TTC CAGGAG 

Zf T CCCACATC ACT GTGGC TGGC TGGAATGT CC TGGCAGACGT GAGGAGCCCTGGC T TCAAGAACGACAC ACT GC GC TC TGGGG 

rs ii 

- TGGTCAGTGTGGTGGAC TCGC TGC TGT GT GAGGAGC AGCATGAGGACCATGGCATCCCAGT GAGTGTCACT GATAACATGTT 

CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGC ATC TCCT GAGCCACGC TGGCAT C T GATGGGACTGGTCAGC TGGAGC TATGATAAAACATGCAGC CACAGGC TCTCCA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAA ATGAA CCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TACTAGGACAGCCAAT TGGAAGATGCCAGGGC TTGCAAGAAGTAAGT T TCT TCAAAGAAGACCATATACAAAACC TC TCCA 
C TCCACTGACC TGGT GGTC T T CCCCAAC T TT CAGTTATAC GAATGCCATCAGCT T GACC AGGGAAGATC TGGGCTTC ATGAG 
GCCCC TT T TGAGGCT CT CAAGT TC TAGAGAGCTGC C TGTGGGAC AGC CCAGGGCAGCAGAGCT GGGATGTGGT GCATGCC TT 
TGTGT AC ATGGCCACAGTAC AGTCTGGTC C T TT TC C TTCCCCATC TC T TGTACACAT TT TAATAAAATAAGGGTT GGCT TCT 
GAAC TAC AAAAAAAAAAA?WVAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEWT I HAKPG FVI QLRFVMLS LE FD YMCQYD YVEVRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKA.GSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHT QLQYEC ISP FYRRLGS SRRTCLRT GKWS GRAPSC I P I CGKI ENI TAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QS LQ I S AI I LHPNYDP I LLDAD I AI LKLLDKAR I S TRVQP I CLAAS R 
DLSTSFQESHITVAGWWUU)VRSPGFKNDTLRSGVVSVVDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide; 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Ami da ti on site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 2Q 



GGTT CCTACAT CCTCT CAT CT GAGAATCAGAGAG CATAAT CTT CTT AC GGGCC C GT GAT TT AT T AACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGA ATGA AGGCGCTTCTGTTGC 
T GGT CTT GCCTTGGCT CAGT CCT GCT AACTACATT GACAAT GT GG GCAACCT G CACTT CCT GTATT CAGAACTCT GTA 
AAGGT GCCT CC CACTACG GCCT GACCAAAGATAGGAAGAGGCGCT CACAAGAT GGCTGT C CAGACG GCT GT GCGAGCC 
T CACAGCCACGGCT C CCT C CCCAGAG GTTT CTGCAGCT GCCAC CAT CTCCTTAATGACAGACGAGC CT GGCCTAGACA 
ACC CT GCCTACGT GT CCT CGGCAGAG GAC GGGCAGC CAGCAAT CAGCCCAGTGGACTCT GG CCGGAGCAACCGAACTA 
I ^ GGGCAC GGCCCTTTGAGAGAT CCACTATT AGAAGCAGAT CATTT AAAAAAATAAATCGAGCTTT GAGT GTT CTT CGAA 

f K \ GGACAAAGAGCGGGAGTGCAGTT GCCAAC CAT G C CGACCAGGGCAGGGAAAAT T CT GAAAACACCACT GC CCCT GAAG 

Q TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA. 
|f| GCCT CT CTATTAG GCT GGT GGGAGGTAGC GAAACCCCACT GGT C CAT ATCAT TAT C CAAC AC AT TTAT C GT GAT GGGG 

||j TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTC^ 

Iff ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
jfj GCAGGAACAAT GGACAGGCCC CGGAT GCCTACAGAC CCC GAGAT GACAGCTTT CAT GTGAT TCT CAACAAAAGTAGCC 

S CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 

t - 

9 GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
Q GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
|f| G GAGC CCT GACAT CTT TCAGGAAG CCGGCT GGAACAGCAAT GGCAGCT GGT CC CCAGGGC CAGG GGAGAGGAGCAACA 

jQ CTCCC&AGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGT^ 

|y GCAT GACCGTCGCAGG GGGAGCAT CACATAGAGAAT GGGATTT GCCTAT CTAT GT CAT CAGTGT TGAGCCCGGAGGAG 

TCAT AAGCAGAGAT GGAAGAATAAAAACAGGTGACATTTTGTT GAAT GT GGAT GGGGT C GAACT GACAGAG GT CAGC C 
GGAGT GAG GCAGT GGCATTAT TGAAAAGAACAT CAT CCT CGATAGTACT CAAAGCT TT GGAAGT CAAAGAGTAT GAGC 
CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGT CATGT GGCT GGAATTACCAC GGTG CT T GTATAACT GTAAAGATATTGTATTACGAAGAAACACAGCT GGAAGTC 
T GGGCTTCT GCAT T GTAGGAGGTTATGAAGAATACAAT GGAAACAAAC CTT TTTT CAT CAAAT CCAT T GTT GAAGGAA 
CAC CAGCATACAAT GAT GGAAGAAT TAGAT GTGGTGATAT T CTT CTT GCTGT CAAT GGTAGAAGTACAT CAGGAATGA 
TACAT GCTT GCTT GGCT^AGACT GCT GAAAGAACTTAAAGGAAGAATTACTCTAACTATT GTTT CTT GGC CT GGCACTT 
TTTT ATAGA ATCAAT GAT GGGT CAGAG GAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATAT TTAT C 
TTGT CAGTT TTTATATTTAAAGAAAGAATACATT GTAAAAATGT CAGGAAAAGTATGAT CAT CTAATGAAAGCCAGTT 
ACACCT CAGAAAATAT GATT CCAAAAAAAT TAAAACTACTAGTTTTT TTT CAGT GT GGAGGAT TT CT CATTACT CT AC 
AACATT GTT TATATTTTTT CTAT T CAATAAAAAGC CCTAAAACAACTAAAAT GAT TGATT T GTATACCC CACT GAATT 
CAAG CT GAT TTAAATTTAAAATTT GGT ATAT GCT GAAGT CT GC CAAGGGTACAT TATGGCCAT TTTTAATT TACAGCT 
AAAATATTT TTTAAAAT GCAT T GCT GAGAAACGTTGCTTTCAT CAAACAAGAATAAATATTTTT CAGAAGT T AAA 
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FIGURE do 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT ISLMTDEPGLDNPAYVS SAEDGQPAI SPVDSGRSNRTRARPFERS T IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI I IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVILNKSSPEEQLGIKLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEPCVWIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDI LLAVNGRS T SGMIHACLARLLKE LKGRITLT I VS WPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myr i s toyl ation s i tes . 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 

CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATCGGCTTCAACCTGACT 

TTCCACCTTTCCTACAAATTCCGATTACTGT.TGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 

CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 

ATAAGACCCT C AT T T T G GG GAAGGGAAAAACT C TGAC TAAT GAAGCAT CCAC GAAGAAGGT AG AAC T T 

GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 

T TT G GAAGAG G T ACAGGCAGAAAATC C C AAAG T GT C C AGAGGC C GGT AT CGC C CT C AG GAAT GTAAAG 

CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 

CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 

TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 

ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 

CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 

GGGTGTTACTGCC CTAAGCAGAGAGCAGTTT TT CAAGGT GAAT GGAT TCTCTAACAACTACTGGGGAT 

GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 

CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 

GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 

T AGT AT C T G T GGAACACAAT C CT T TAT AT AT CAAC AT CAC AGT GGAT TTCTGGTTTGGTG CATGACC C 

TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGGCTAGAGACTTCAA 

ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 

TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGT T GTAACAAGACAGCTT T CTTAGTCATTTT GAT 

CAT GAGGGTTAAATATTGTAATAT GGAT ACTTGAAGGACT T TATATAAAAGGATGACTCAAAGGATAA 

AATGAACGCTAT TTGAGGACT CT GGTT GAAGGAGAT TTAT TTAAATT TGAAGTAATATAT TATGGGAT 

AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCA.GAGTTGTTCTCGTCCAAGGTAGAA 

AGGTACGAAGAT ACAATACT GTTAT TCATTTAT CCTGTACAAT CATCTGTGAAGTGGTGGT GTCAGGT 

GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGT GAACTT GGGAATGAAGA 

GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 

CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 

TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 

AT T AAC T AAT AAT AAAT AT GT CTAT CAAATAC C T CTGT AGTAAAAT G T GAAAAAG CAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites . 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCA?^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA2^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



III 

m 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQI^LQPQDRAGARA.SWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site, 
amino acids 1-12 
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FIGURE A* 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATG GCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
AT CGTAATAGGGAGAGAGT AGAC TT CCCAGAT GGAGGCTAC TCCC TGAAGC TCAGCAAACT GAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGT ACGT GC TGCATGT C TACGAGCACCT GTCAAAGCCTAAAG TCACCAT GGG TCTGCAGAGCA 
ATAAGAAT GGCACCTGTGT GACCAATCT GACAT GCT GCAT GGAACATGGGGAAGAGGAT GT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTTTCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGG 
AAAC TC CTAACATATGCCCCCAT TCT GGAGAGAACACAGAGT ACGACACAAT CCC TCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGTW^TCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TC TAGACAGCAGT GCAC TCCCCTAAGT CTCT GC T CA 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSi^SGPVKELVGSVGGAVTFPLKSKVKQVDSIWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRN FS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEE Y I E 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 

PRLFAYENVI 



Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 



Leucine zipper pattern. 

If! 

f*% amino acids 229-251 

III 

13 N-glycosylation sites. 

til 



amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTGCAAGC 
AACAGTAATGCCAATTGT GAAT TTTCATT GAAAAACATCAGTGACATTCATCCAGAATCCTT CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGT GGCT GGAGAGCATC TAGT T T CCACTT CGAT TCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TG TAG T T TAATGGGAATAAAAT GTAAGT AT CAGTAGT TT GAAAAAAAAAAA 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTiyiASGWRAjSSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 



Q N-myristoylation sites. 

Ill amino acids 22-28, 54-60, 83-89, 97-103, 216-222 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 



TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAG T GAGGAAC CT GCAGAGTCACACAGT TGC T GACCAAT T GAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT TTGCCAATAAAGACGATCCC TTC T ACTATGACT GGAAAAACC TGCAGC T GAGCGGACT GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
C AGC C AGAAG CAGC ACAG T C C T G T AC C T GAGAAGGC C AT C CC AC T CAT CAC T CC AGGC T C T G CCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEAOT 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 



amino acids 36-59 



1=1 Transmembrane domain: 

o. 

W ' 

HI N-myristoylation sites. 

tO amino acids 41-47, 45-51, 84-90 

s 

O 

m 
o 
w 
a 
in 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCT GG TGGGAGGAAGACAC T C T GGAGAGAG AGGGGGC TGGGCAGAGATGAAGT TCCAG 
GGGCCCCTGGCCTGCCTCCTGGTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
f* k GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
% AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
m TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCeGCGGCTCCTGGCAGGGG 
jh| GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
III CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
HS GAAACT CAGCAGGCAGC T T T GGAATGAATCC TCAGGGAGCT CCCT GGGGT CAAGGAGGCAAT GGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
pa AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
U\ ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
O AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
W' TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
Jrf CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
C T TCAGAGGACAGGGAGT T T CCAGCAACAT GAGGGAAATAAGCAAAGAGGGCAAT CGCCTCCT TG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T TAAATCCAAGC TGGG T T TCAT CAAC TGGGAT GCCATAAACAAGGACCAGAGAAGC TC TC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^^ 
iU^AAAAAAAAAAAAA?\AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGES GIQGFRGQGVSSNMRE I SKEGNRLLGGS GDNYRGQGSSWGS GGGDAVGGVNTVNSET S PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-2 69 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATOTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTGCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
, . TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
|=| AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
O GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CP CCT CAGAGGGCAGCAGTCGTCT GGACAT GT TT GAGCACAT CAGCCT CATGACCT TGGACAGT CTA 

s „ f. 

;jf CAGAAAT GCAT C TT CAGCT T TGACAGCCAT TGTCAGGAGAGGCC CAGT GAATATATTGCCACCAT 

JJi CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 

jj: ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

5 GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

O AGCCAAGT CCAAGACT TT GGAT T TCATT GATGTGCT T CTGCTGAGCAAGGAT GAAGAT GGGAAGG 

CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
Q ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
f|j AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLLSLPWLGLRPVAMSPWLLLLLVVGSWLLAM 

LITPTEEGLKDSTQMSATYSQGFTVWLGPI IPFIVLCHPDTIRS ITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDIiAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

l^FIPFSAGPRNCIGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAE 

VGLQ 

Important features : 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE fig 

ATCGCATCAATT GGGAGTACCATCT T CCT CATGGGACCAGT GAAACAGCT GAAGCGAATGT T TGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CAT GGCCAGT T TTATGAAGCT TT GGAAGGCACTAT GGACAGAAGCT GG T GGACAGTT TT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGAT T CGAACAT T TGAGGGTTAC T T T TGGAAGCAACAATACAT T CT CGAACC TGAAT GT CAG T A 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 56 



MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGIALIFCILQSLALTWYSLSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE S7 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TAC GT GCC GGT CATCG GGGAAGC CCAGACC GAGTT CCAGT ACTT T GAGTC GAAGGGGCTC C CT GC CGAGCT GAAGTCC 
AT TTTCAAGCT CAGT GT CTT CAT CCCCT CC CAG GAATT CT C CACCTACCGC CAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTT GAT GGGCAGCTAGACTTT GAAGAATTTGT CCATTAT CT C CAAGAT CAT GAGAAGAAGCT GAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATAT CTGAACAGCAGGCAGAAAAAATT CT CAAGAGCATGGATAAAAACGGCACGAT GACCAT CGACT GG 
AAC GAGT GGAGAGACTACCACCT CCT C CACCCCGT GGAAAACAT C CCCGAGAT CAT CC T CTACT GGAAGCATT C CACG 
AT CTTT GATGT GGGT GAGAATCTAACGGT CC CGGAT GAGT T CACAGT GGAGGAGAG GCAGACGGGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCT CCCGCAG CAACAACAT GGGCAT C GTTGGTGGCTTCACT CAGAT GAT T C GAGAAGGAGGGGCCAGG 
TCACTCTGGCGGGGCAATGGCATC^ACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
%^ AT CAAGCGCCTTGTTGGTAGT GACCAGGAGACT CTGAGGATTCACGAGAG GCTT GT GG CAGGGT C CTT GGCAGGG GCC 

l s , ATCGC CCAGAGCAGCATCT ACCCAAT GGAGGT CCT GAAGAC CC GGATGGC GCT GCGGAAGACAGGCCAGTACT CAGGA 

W AT GCT GGACT G CGCCAGGAGGAT CCT GGC CAGAGAG GGGGT GGCCGCCTT CTACAAAGGCT AT GT CCCCAACAT GCTG 

#J GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
]Z AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
■V" C CCCT GGCCCTAGT CAGGAC CCGGAT GCAGG CGCAAGCCTCTATT GAG GGC GCT CCGGAGGT GAC CAT GAGCAGCCT C 

l|J TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 

G CT GT GAGCAT CAGCTACGT GGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGG TGA CGGGGGGAGGGC 
IMI CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
f|! AAGCT GTCT CGAGCCAAGCT GT GAAAACC CTAGACGCACCCGCAGG GAGGGTGGGGAGAGCT GGCAGGC CCAGGGCT T 

% GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
ffi f ,i CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
% TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
jpsfe CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
%A ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
iff TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 

CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
W GGCCTGGAC CCT GT CAGGAT GGGC CCC AC CT CAGAACCAAACTCACTGT CCC CACT GT G GCAT GAGGGCAGT GGAGCA 

|y C CAT GT T T GAGGG CGAAGGGCAGAGCGT TT GT GTGTT CT GGGGAGGGAAGGAAAAGGT GTT G GAGG CCTTAATTATGG 

f£L ACT GTT GG GAAAAGGGT TT TGT CCAGAAGGACAAGCCGGACAAATGAGCGACTT CT GT GCT T CCAGAGGAAGACGAGG 

W GAGCAGGAGCTT GGCT GACTGCT CAGAGT CT GTT CT GAC GC CCTGGGGGTT CCT GT CCAACC CCAGCAGGGGCGCAGC 

f|j GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 

AACTATTT T TATAGAT TT GTTT AATT AATAGCT T GT CAT TTT CAAGT T CATTTTTTATT CATAT TTAT GTT CAT GGTT 
GATT GTAC CTT CC CAAGCC CGC CCAGT GG GAT G GGAG GAGGAGGAGAAGGGGGGCCTTGGGC CGCT GCAGT CACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GT T GGGGAAGGGCT TGCC CCCAGCCT TAGGATTT CAGGGTTT GACT GGGGGCGT GGAGAGAGAGGGAGGAAC CT CAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CT T CTGCTGCCCTT GCTTAACAAT GCC G GCCAACT GGCGAC CT CAC GGT T GCACTT C CATT CCACCAGAAT GACCT GA 
T GAGGAAAT CTT CAATAGGAT GCAAAGAT CAAT GCAAAAATTGT TATATAT GAACATATAACT GGAGT CGT CAAAAAG 
CAAATTAAGAAAGAAT T GGACGTTAGAAGTT GT CATT TAAAGCAGC CTTCTAATAAAGTTGTTT CAAAG CT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



hi 

a 

?f 2 



MLCLCLYVPVI GEAQTE FQYFESKGLPAELKS I FKLSVFI PSQEFS T YRQWKQKI VQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKI LDKKNDGR I DAQE IMQS LRDLGVKI SEQQAEK I LKSMDKNG 
TMT I DWNE WRDYHLLHPVENI PE I I LYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSl^GIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQI KRLVGSDQE TLR IHERLVAGS LAGAI AQS S I Y PME VLKT RMALRKT GQYS GMLDCARR 
I LAREGVAAFYKG YVPNMLGI I PYAG I DLAVYE TLKNAWLQHYAVNSADPGVFVLLACGTMS S TC 
GQLAS YPLALVRTRMQAQAS I EGAPEVTMSS L FKHI LRTEGAFGLYRGLAPNFMKVI PAVSISYV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGAT GGAATCCTGAGCTGCACT T T T GAACC TGACAT CAAACT T T CT GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGC TGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGT G T T T GCT GATCAAGT GATAGTT 
M ; GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
O CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
|J AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
Tl CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
lf| CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
fij T CAACAACACATACT CCT GTAT GATTGAAAATGACAT TGCCAAAGCAACAGGGGATAT CAAAGTG 

«K ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 

CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

Jfjj GTGCCT TGGCCACAAAAAAGCAT GCAAAGT CAT T GT TACAACAGGGAT CTACAGAACTAT T T CAC 

Q CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 
||I AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCT CCAATAT GAACAAGATAAAT 

D CTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGA 
5 11 GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T C T TCCCAAC TGACAAAT GCCAAAGT T GAGAAAAATGAT CATAAT T TTAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAARCTGAGCACCTTCTTTTTAAACAAAAAA7VAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
Y 1 1 T S KGKGNANLE YKTGAFSMPEVNVDYNAS SET LRCE APRW FPQP T WWAS QVDQGANFS E VS 
NTSFELNSENVTMKWSVLYWTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVSS FFAI SWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

T GACGTCAGAAT CACCATGGCCAGC TAT CCT TACCGGCAGGGC TGCC CAGGAGCT GCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
%A GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
G AT TGGT CT TCAT TCAATGAT GAGACC TGCCTCATGAT GATAAACATGT T T GACAAGACCAAGTCA 

y GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
~?l CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
|f| AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
III GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
4l GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
JL ' CCATGACAGCTTCTCGGATGCT ATGAC CCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 

T CCTGGCT T C T TAGAG TGAGAGAAG TAT GT GGACATCT C T T CT T T TCCTGTCCC T C TAGAAGAAC 
Q ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
|d AATAGTGAGGACC GGGGCT GAGGCCACACAGATAGGGGCCT GAT GGAGGAGAGGATAGAAGT T GA 

C3 ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

fll 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C T GACCAAGCATGAGAGAGATCT GTC TAT GGGACCAGT GGCT TGGAT TC TGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMINMFDKTKSGRIDWGFSALWKFIQQWKNLF 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 

DTAVQGNIRLS FEDFVTMTASRML 

| s j ; Important features of the protein: 

O Signal peptide: 

7;;; amino acids 1-19 

|f| N-glycosylation site. 

ffj amino acids 147-150 

r 

85SS! 

^ Casein kinase II phosphorylation sites. 

J>J amino acids 135-138, 150-153, 202-205, 271-274 

l|j N-myristoylation sites. 

13 amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 

111 66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCAJCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAAT CAACCCCCAGGTACCAGC TGGGGAAATAT TAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGGCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^^ 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWP 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGYLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKES FLTNS S YES S FLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



N-glycosYlation site. 

amino acids 37-41 



Important features: 
Signal peptide: 

amino acids 1-19 

M 
G 
Q 

m 

w . 

fjf N-myristoylation sites. 

(0 amino acids 15-21, 19-25, 60-6* 

O 
111 
C 
M 

a 
ru 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGGCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC T CT GCCCC TGAGGT CAAGAGAGGAT GGGGC TAT T CAC T T T TAT AT AT T TATATAAAAT TAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 




i y 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



* 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 



N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6Q 

GCCAGGAATAACTAGAGAGGAACAATGGGGTTATT CAGAGGT T TTGTTTT CCT CTT AGTTCT GTGCCTGCT GCAC CAG 
T CAAATACTT C CT TCATT AAGCT GAATAATAAT GGCT T TGAAGATATT GT CATTGTTATAGAT CCTAGT GT GC CAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCTT CTACGT ACCT GTTTGAAGC CACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACT^AAAGGCCAAAACATGAA 
AAC CATAAACAT GCT GAT GTTATAGTT GCACCACCTACACT CCCAGGTAGAGAT GAACCATACAC CAAGCAGTT CACA 
GAATGT GGAGAGAAAGG CGAATACAT T C ACTT CACC C CTGACCTT CT ACT T GGAAAAAAACAAAAT GAAT AT GGACCA 
CCAGGCAAACT GTTT GTCCAT GAGT GGGCTCACCT C CGGT GGGGAGTGTTT GAT GAGTACAAT GAAGAT CAG CCTTT C 
T ACCGT GCTAAGT CAAAAAAAAT CGAAG CAACAAGGT GTT CCG CAGGTAT CTCT GGT AGAAAT AGAGTTTATAAGT GT 
CAAGGAGG CAGCT GT CT TAGT AGAGCAT GC AGAATT GATT CT ACAACAAAACTGTAT G GAAAAGATT GT CAATTCT T T 
C CT GATAAAGTAC AAACAGAAAAAGCAT CCATAAT GTT TAT GCAAAGTAT T GATT CT GT T GTT GAATT TT GTAACGAA 
AAAACC CATAAT CAAGAAGCT CCAAGCCTACAAAACAT AAAGT GCAAT T T T AGAAGTACAT GGGAGGT GATTAG CAAT 
IU, TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
L:, AGAATT GT GT GCTTAGTT CTT GATAAGT CT GGAAGCAT GGG GGGTAAGGACCGC CTAAAT CGAATGAAT CAAGCAGCA 

*«J AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
P AATAAGCT AAT CCAAATAAAAAGCAGTGAT GAAAGAAACACACT CATGGCAGGAT T AC CTACATAT CCT C T GG GAGGA 

»~l ACT T CCAT CT GCT CT GGAATTAAATATG CATTT CAGGTGATT GGAGAGCT ACATTCCCAACTCGAT GGAT CCGAAGTA 

V s CTGCT GCT GACTGATGGGGAGGATAACACT GCAAGTTCTT GT AT T GAT GAAGT GAAACAAAGT G GGGCCATT GT TCAT 

|.|| TTTATT GCT TT GGGAAGAG CT GCT GATGAAG CAGTAAT AGAGAT GAGCAAGATAACAG GAGGAAGT CATTTTTAT GTT 

5S9 T CAGAT GAAGCTCAGAACAAT GG CCT CATT GAT G CT TTT GGGGCT CTTACATCAGGAAATACT GAT CT CT CCCAGAAG 

^« T CCCT T CAGCT CGAAAGTAAGGGATTAACACTGAATAGTAAT GCCT GGAT GAACGACACT GTCATAAT T GATAGTACA 

fjj GT GGGAAAGGACACGT T CT T T CT CAT CACAT GGAACAGT CT GC CT CC CAGT AT TTCTCT CT GGGAT CCCAGTGGAACA 

"p ATAAT G GAAAATTTCACAGT GGAT GCAACTT CCAAAAT GGC CT ATCTCAGTATT CCAGGAACT GCAAAGGT GGG CACT 

T GGGCATACAAT CTT CAAGCCAAAGC GAAC CCAGAAACATTAACT ATTACAGTAACTTCT CGAG CAGCAAATT CTT CT 
» GTGC CT CCAAT CACAGTGAAT GCTAAAAT GAATAAG GAC GTAAACAGT TTC CCCAGCC CAAT GAT T GTTTACGCAGAA 

ATT CTACAAGGATAT GTAC CT GTT CT TG GAGC CAAT GT GACTG CTTT CATT GAAT CACAGAAT GGACATACAGAAGTT 
W TTGGAACTTT T G GAT AAT GGT GCAGGCG CTGAT T CTTT CAAGAAT GAT GGAGT CTACT CCAGGTATTT TACAGCAT AT 

Ifl ACAGAAAAT GGCAGATAT AGCTT AAAAGT T CGGGCT CAT GGAGGAG CAAACACT GCCAGGCTAAAATTACG GCCT C CA 

f»i CTGAAT AGAGCCGCGTACATACCAGGCT GGGTAGTGAAC GGGGAAAT T GAAGCAAAC CCGC CAAGAC CTGAAAT TGAT 

;H; GAGGATACT CAGACCAC CTT GGAGGATT T CAGCC GAACAGCAT CCGGAGGTGCATTT GT GGTAT CAC AAGT CCCAAGC 

lij CTT C CCTT GC CTGAC CAATACCCACCAAGT CAAAT CACAGAC CT TGAT GCCACAGTT CATGAGGATAAGAT TATTCTT 

#4 ACAT GGACAGCACCAGGAGATAAT TTTGATGTT GGAAAAGTT CAACGT TAT AT C ATAAGAATAAGT GCAAGTATT CT T 

y GAT CTAAGAGACAGTT TT GATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 

f|| AGCT TT GCATT TAAACCAGAAAATAT CT CAGAAGAAAAT GCAACCCACAT ATTTAT T GC CATTAAAAGTAT AGATAAA 

AGCAATTTGACATC^VAAAGTATCCAACATTGCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGA 
CCTACACCT ACT C CTACT C CTACT CCTACTCCT GATAAAAGTCATAAT T CT GGAGTTAATATT T CTACGCT GGT ATT G 
T CT GTGATTGGGT CTGTT GTAATTGTTAACTTTATTT TAAGTACCAC CATT TGAACCTTAAC GAAGAAAAAAAT CTT C 
AAGT AGAC CTAGAAGAGAGTT TTAAAAAACAAAACAAT GTAAGTAAAGGATATTTCT GAAT CT TAAAATTC AT C CCAT 
GT GT GAT C^T AAACTC7VTAAAAATAATTT TAAGATGT CG GAAAAGGATACT TTGATTAAATAAAAACACTCAT GGAT A 
T GT AAAAACTGT CAAGAT TAAAAT TTAATAGT TT CAT TTATTT GTTAT TTTATT TGTAAGAAATAGT GATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCIAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAAT TAAAAT CAT CTAT CTGAGTAGT CAAAATACAAGT AAAGGAGAGCAAATAAACAACATT T GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIYAPPTLPGRDEPYTKQFTECGEKGEY 
I HFT PDLLLGKKQNE YGP PGKL FVHE WAHLRWGVFDE YNEDQP FYRAKS KKI EATRC SAG I S GRN 
RVYKCQGGS CL S RACR I DS T TKLYGKDCQFFPDKVQTEKAS IMFMQS I DS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
ijL YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
p SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
Q I TWNSLPPS I SLWDP SGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 

^ AANSSVPPI TVNAKMNKDVNS FPSPMI VYAE I LQGYVPVLGANVTAFI ESQNGHTEVLELLDNGA 

GADS FKNDGVYS RY FTAYTENGRYS LKVRAHGGANTARLKLRPPLNRAAY I PGWWNGE I EANP P 
fq RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
-C* FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
* m KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
*J VNFILSTTI 

m 

Q 

| g | Signal peptide: 

Q amino acids 1-21 

iU 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGT GGAAACC CTGGGAGTAGAGTACT GACAGCAAAGAC C GGGAAAGACCATAC GT CCC CGGGCAGG GGTGA 
CAACAGGT GT CAT CTTT TTGAT CTCGT GTGT GGCT GCCTT C CTATTT CAAGGAAAGAC GC CAAGGTAATT TTGAC CCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCTT CCCT T CTT GAACC C CCAGT T ATGCCAGGATTTACTAGAGAGTGT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAAT GAAGGT GAAAACTT G GAGATT T CACTT CAGT CATT GCTTCTGC CT GCAAGAT CAT CCTTT AAA 
AGTAGAGAAGCT GCTCT GTGT G GTGGT TAACTCCAAGAGGC AGAACT CGT T CTAGAAGGAAAT GGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTT GCT GGT GCTC CT CTGCT GTGCTATCT CT GT C CT GTACATGT T GGCCT GCACC C CAAAAGGT GACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACT ACGT GAGCAGCCT GAAGC GGCAGAT CGCACAGCT CAAG GAGGAGCT GCAGGAGAGGAGT GAGCAGCTCAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGCCTT CCTGCACTC GCAGGTGGACAAGGCAGAGGT GAAT GCT GGCGT CAAGCT GGCCAC AGAGTATGCAGCAGT G 
C CTTT CGATAG CTTTACT CT ACAGAAGGTGT ACCAGCT GGAGACT GGC CT TACCCGCCAC CCCGAGGAGAAGCCT GT G 
AGGAAGGACAAGCGGGAT GAGT T GGT GGAAGC CAT T GAAT CAGCCT T GGAGACCCT GAACAAT CCT GCAGAGAACAGC 
CCCAAT CACC GT C CT TACACGGC CT CT GAT TT CATAGAAGG GATCTAC CGAACAGAAAGGGACAAAGGGACATTGT AT 
GAGCTCACCTTCAAAGGGGACCACAAAC^CGAATTCAAACGGCTCATCTTATTTCGACC^TTCAGCCCCATCATGAAA 
GT GAAAAATGAAAAGCT CAACAT GGC CAACACG CTTAT CAAT GTTAT CGT G CCT CTAGCAAAAAGGGT GGACAAGTT C 
CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GAAAACACT T CCAAAGCT GCCAACTTCAGGAACTTTACCTTCATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GTATTTTATCC^GTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAGCT GGT CATAAAGAAGGAAACTGGAT TTTGGAGAGACT TT GGATTT GGGAT GACGT GTCAGTATCG GT CAGAC 
TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
CACAGCAACCT CATAGT GGTACGGACGC CT GTGCGAGGACT CTTCCACCT C T GGGAT GAGAAGCG CTGCAT GGACGAG 
CTGACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGGGCATGCTG 
GT GTT CAGGCAC GAGATAGAGGCTCACCTT CGCAAAC^GAAACAGAAGACAAGTAGCAAAAAAACATGAACTC CCAGA 
GAAGGAT T GT GGGAGACACTTTTT CTTT CCT TT T GCAATTACT GAAAGT GGCTGCAACAGAGAAAAGACTT CCATAAA 
GGACGACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGCCTCCGATT T CT CT CT GTT GGGCTTTTTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGT GGAGGT TTT GAT GGT GT TTACAATACACT GAGACCTGTT GTT TT GTGT GCTCATT GA 
AATATT CAT GATTTAAGAGCAGTTTT GTAAAAAATT CATT AGCAT GAAAGGCAAGCATATT TCTCCTCATATGAAT GA 
GC CT AT CAG CAG GGCTC TAGTTT CTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGAT AGGCTT ATTATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
T TT TTT CC CT T GT GAGT TATAGT CTGCTTATTTAATTACCACTTT GCAAG CCTTACAAGAGAGCACAAGTT GGCCTAC 
ATT TTTATAT TTTTTAAGAAGATACTTT GAGAT GCAT TAT GAGAACTTTCAGTT CAAAGCATCAAATT GAT GC CATAT 
CCAAGGACAT GCCAAAT GCT GATTCT GT CAGGCACT GAAT GTCAGGCATT GAGACATAGG GAAGGAAT GGTTT GTACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACT TT CT GCT TTACAGAAAAGGAAACTCATT CAGACT GGT GATAT CGTGAT GTACCTAAAAGTCAGAAAC CACATT TT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGT GCT CCT CCT GGCTTCTGGCTTCCATAAGAAGAAAT GGAGAAAAATATATATAT ATATATATATATT GT 
GAAAGATCAATCC^TCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATTATTTTTTAAATTAAGCAGTT CTACT CAATCACCAAGAT GCT TCTGAAAATT GCATT TTAT TACCAT TT 
CAAACTATTT TTTAAAAATAAATACAGT TAACATAGAGTG GTTTCTTCATT CAT GT GAAAATTATTAGC CAGC AC CAG 
AT GCAT GAGCTAATTAT CTCTTT GAGTCCTT GCTT CT GTT T GCT CACAGT AAACT CAT T GTTTAAAAGCTTCAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCATGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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FIGURE 72 

MMMVRRGLLAW I S RVWLLVLLCCAI SVLYMLACT PKGDEEQLALPRANS PT GKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATE YAA.VP FDS FTLQKVYQLETGL TRHPEEKPVRKDKRDE LVEAI E SALE TLNNPA 
ENSPNHRPYTAS DFI EG I YRTERDKGTLYELT FKGDHKHE FKRL I L FRPFS P IMKVKNEKLNMAN 
TLINVIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
T FI QLNGE FSRGKGLDVGARFWKGSNVLL FFCDVDI YFT S E FLNTCRLNTQPGKKVFYPVL FSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GCCCAG TC GCCCAT CGAT AT TCAGACAGACAGT GT GACATT T GACCCT GAT T TGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCTTATGAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCGCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GGTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T T CAAT GGAACAGC TGGAAAAGCT T CAGGGGACAT TGT T CT CCACAGAAGAGGAGCCCT CTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTC T TCACCT CAGCACAAGCCACGACT GAGGCATAAATT CC T TCT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACT GTAGGAGTAG TAAGCAGAT GT CCTCC T TCCGCT GGACAT C T CT TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CT CC T TAGGATAAAGAGT T GCT GT T GAAGT TGTATAT T T T T GAT CAAT AT AT T TGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT SAQAT TEA 

Important features of* the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7fi 



TGCCGCTGC.CGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGTCTT CAAGGAGTTAAAGT TACT TACACTGTGCAGTAT TTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCT TCCTGT TTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
ACTT TATCACCCTCAATATCTCGGATGAT TCTAAAAT TTC TCAT CAGGATATGAGTT TACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATT TAGGGTATGC T TC GCAT T T GATGGAAAT T T T T TGTGAC TCT GAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 
GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
C CAGACAGGCCACCAGGAGAAAAT GAAACC TAT C TCATGCAAT T CAT GGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
C TGTGAGAAT TACT TATT T CTT T T CT C TAT TC T CATAGCACGTGTGT GAT T GGT TCAT GCAT GTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I TVFL FS VMG YS I YRY I HVGKEKHPANLI LI YGNE FDKRFFVTAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAA.GCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGT GGAGTT CCACATGACGACTGAGGCCCAAGCCACCATCCGCAT GGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
C CAAT CT AG T GAAAAACCAGC TGTGTCCCGT GAT C GAGGC T T C C T T C AAT GG CAT GTAT GCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATGAAT GAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCC TAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGGCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T TTACACCAAAGGTGACCAACT TAT ACT CAAC T T GAATAACAT CAGC T CT GATCGGAT CCAGCT G 
AT GAACT C T GGGAT T GGC TGGT T CCAACCT GATGTT C TGAAAAACAT CAT CAC T GAGATCAT CCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWT FT LLCGLLAATL I QAT LS PT AVL I LGPKVI KEKLTQELKDHNATS I LQQLPLL SAMREK 
PAGGI PVLGS LVNTVLKH I IWLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGM YADLLQLVKVP ISLSI DRLE FDLLYPAI KGDT I QL YLGAKLLDS QGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQI VKILTQDTPE FFI DQGHAKVAQLI VLEVFPSSEALRPLFTLGI EAS SEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAE SSL TKDALVL T PAS LWKP S S PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGTCAGCC T GGCAGAGAGACTCT GAAATGAGGGAT TAGAGG T GT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
C C AAGGG C C T C TGGAT GGAAT G T GC CACAC ACAGCAC AGG CAT CAC C CAG T GT GACAT C T ATAG C 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
| s i CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
D GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
M ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
f!- CAT GAAAT T TGAGAT TGGAGAGGC T CT T TACT TGGGCAT TAT T T CT T CCCTGTTC TCC CTGATAG 

iy 

iffi CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
111 . CAAGCCCAACCTCT T GCCACAAGGAGCT CT CCAAGGCCTGGTCAAC C T CCCAAAGTCAAGAGTGA 

-IK GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
^ GGT C T GT GAAAAACAG TGGACAGCAC CCCGAGGGCCACAGGTGAGGGACAC TAC CAC T GGAT CGT 

Sjj GTCAGAAGGT GC TGCTGAGGAT AGACTGACT TT GGCCAT TGGAT TGAGCAAAGGCAGAAAT GGGG 

pi GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
I?! TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
O GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACT AAT T TGTGCATGAAC TGAAATAAAACCATCC TACGGTAT C CAGGGAACAGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 80 



MAS LGLQLVG YI LGLLGLLGTLVAMLLPS WKTS S YVGAS I VTAVG FSKGLWMECATHS TG I TQCD 
I YS TLLGLPAD I QAAQAMMVT S S AI S SLAC 1 1 S WGMRC TVFCQE S RAKDRVAVAGGVFF I LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNS YSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



, CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTGCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGG ATG GTTCCCGAGGTGRGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
1^ CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
It GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
yl TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTAGCAACACGGAG 
HI AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
PJ ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
4 s ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
jjj AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
Q CAGACCCAAGGGAGCAGGCAGCACAACT GT CAAGATCGT CC T GAAGGAGAAACATAAGAAAGCCT 

III GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCT TCCACT TGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACAT AACAAAGACCTAA.CAGT TGCAGATATGAGCT GTATAATT GT T GT TAT TATATAT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVS CYRLHC PPVHC PQPVTE PQQCCPKCVE PHTP S GLRAPPKS CQHNGTMYQHGE I FSAHEL F 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 



App_ID=10063584 



Page 232 



FIGURE 83 



fit, 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
C C AGTT CTCT AGCGTGGT CC ACCT CT AC AGGGAC GGGAAG GACC AGCCAT TT AT GCAG AT GCC ACAGT AT C 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AG GT GTC AGC AC TGGGCT C AGT T C CT CT C ATT TC CAT C ACGGGAT ATGTTGAT AGAGACAT CC AGCT AC 
T C T GTC AGT C CT CGGGCT GGT T CCCC CGGC CC ACAGCGAAGT GGAAAGGT CCAC AAG GAC AG G ATT TGT C C 
AC AG AC TCCAGGAC AAAC AG AGACAT GC AT GGCCTGTT T GAT GT GGAGAT CTCT CT GACCGT CCAAGAGAA 
CGC C GG GAGC AT ATC CTGT TCC AT GC GGCAT GCT CA*TCT G AG CC GAGAGG TGGAAT C CAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
T TT T TT GGCATT GTTG GACT GAAGAT TTT CT T CT CC AAATTC CAGT GG AAAAT C CAGGCGGAACTGGACT G 
GAGAAGAAAGCACGGACAGGCAGAATTG AGAGAC GCCCGG AAACACGC AGTGGAGGT G AC TCTGGATCCAG 
AGACGG CTCACC CG AAGCTC TGCGTT TCTG AT CT GAAAACT G T AACCC AT AGAAAAGCT CCCCAGGAGGT G 
CCTC AC TCT GAG AAGAGATT TACAAGGAAGAGT G TGGT GGCT T CTC AG AGTT TCCAAG CAGGGAAACATT A 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT ACGT GACT T T GT CTCCCGAT C ATGGGT ACT GGGT CC T C AGACTGAATGGAGAACAT T TGT AT TT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TATT GAGGCC CT AC AT TGAGT ATCCGT C CT AT AAT GAG CAAAAT GGAACT CCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTT CT TTAGGG AT AT TAAGGT CTCT CT CC CAGATC C AAAGT CC CGCAGC AGCC GGCCAAGGT GGCT T CCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTAC ATTT AGTT TGCT CTCACT CC AT C T GGCT AAGT GATCTT GAAAT ACC ACC T CT CAGGTGAAG 
AACCGT CAGGAAT T CC CATCTC ACAGGCTGTGGT GT AG ATT AAGT AGACAAGGAAT GTGAAT AAT GCT TAG 
ATCT T ATTGATG AC AGAGTGTATCCT AATGGT T T GT TC ATT AT ATT AC AC TTTC AGT AAAAAAA 
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FIGURE 8 J. 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRI SSQSYYQKAI WELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCCT GAGCATCAGAGAT GCCAGAAGAAGTGATGCGGGGAGA 
TACTTCT T TCGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
\n GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 



HI ACAAGACC GT CCAT C T CAACG T GT CC TACCCGCC TCAGAAC TT GACCAT GACTGTC T TCCAAGGA 

-jS GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
* w GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
g*% CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
|y GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

O CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

ft % 

-.^ TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCC T T GGGAC TC GCGGGGACAGGAGGCCAC TGACACCGAGTAC TC GGAGATCAAGAT CCACAG 
ATGA GAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGATTCT TGTAGAAT TAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTAT G 
T GCAGAGT GAAAAGCACACAGGCT T TAGAG TCAAAGTAT C TCAAACCT GAAT CCACACT GTGCC C 
T CCCT TT TATTT TT TTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFWVRSCRKKSARPAAGVGDTGIE 

|»* DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

O YSEIKIHR 



| g = Signal peptide: 

||| amino acids 1-15 

r 

Transmembrane domain: 

1~ amino acids 351-370 



III 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
T CATAGCGACCACCAGAGGATGGAGTACAGATGAGGC TAATACT TAC T TCAAGGAAT GGACC TGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTT TGGATCTGCAGAGGCGGCCACGAGCGAT GACTACAAGAACCC TGGC TACT 
ACGACAT CCAGGCCAAGGACC T GGGCATC TGGCACGT GCCCAATAAGTCCCCCATGCAGCAC TGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAATTCACTGCGGGAT TTGT TCAGTTCAGGGTAT TTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
T TGAGAG T T TTGTGGGAGGGAACCCAGACCT CTCC TCCCAACCAT GAGAT C CCAAGGATGGAGAA 
CAAC T T ACC CAG TAGCTAGAAT GT TAAT GGCAGAAGAGAAAACAATAAATCATAT TGAC TCAAGA 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 



SiXt 



amino acids 163-167 



Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 




N-myristoylation sites. 



amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGGTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
h% TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
O TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
CP GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
ft TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
« TTAAGACT TATATACAGTTTTAGGGGACAAT TAAAAAAAAAAAA 

m 

as 



:: 

m 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVTUjSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

u 
Q 
O 
01 

if! 

m 

4 



0 

m 
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FIGURE Qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCC T CAGCCAAGGTGAAGAAAAT CT T T GGC TGGGGCGAC TT C TACT CCAACAT CAAGAC GGT G 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCC TAC TACCCATC T GGGTGACCCGGGGCAGGCCACAGAGGCCAGGC CAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGGAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCT TAAGC TAAGACAGGACGAT TGTGGTC C TCC CACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALMLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI SIS LVPPSKAVE FHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQS SATWSCSQPFKVVCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGT TCCGATT TGCATAT TATAAAC T CT T AAAAAAAGCCAGTGAAGGT T T GAAGAGTATAAACC C 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
(3 GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
y ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
** J TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
i-k TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
fsj AAGCC T GAAAC TC T GCC T GCT CT GCCAAGACAAGAAC TT T C TT CT T TACAACCAGCGC TCCAGAT 

=p AACCTCAGGGAACCAGCAC TT CCCAAACC GCAGAC TACAT C TT TAGAGGAAGCACAAC TGTGCCT 

TT T T C TGAAAATCCC T T T T TC T GGT GGAAT T GAGAAAGAAATAAAAC TATGCAGATA 

o 
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FIGURE Qd. 

MTAAVFFGCAFIAFGPALALWFTIAIEPLRIIFLIAGAFFWLVSLLISSLWFMARVIIDNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 95 

AAT TT TT CACCAGAGTAAAC TT GAGAAACCAAC TGGACC T TGAGTAT T GTACAT T T TGCCT CGT G 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGAT GC TCACACTGGGGCCAGATC T GCAT CTGT TAAATC CTGC TGCAGGAATGACACC TGG TAC 
C CAGACCCACC CAT TGACCCT GGGAGGGT T GAAT GTACAACAGCAACT GCACCCACAT GTGT TAC 
CAATT TT TGTCACACAACT TGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAAT CCAGATGTCCAGGATGGAAGCCT TCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
G7VACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
G C AG G CAT C CAAAG GAGCAC AC AT G C CAT C GAGGAAGCCAC CACAGAAT CAG CAAAT GGAAT TC A 

GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TAT GGAAT AGAT T GAG ACACAT T GGAT AG T C T T AGAAGAAAT TAAT T C T TAAT T 
T AC CT GAAAATATTC T TGAAAT T T CAGAAAATAT G TTCTATGTAGAGAAT CCCAACT T T TAAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
T AAAACATAT T T GGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTS QAGAMPDVQDGS LPAGGAGVNPATQGT PAGRL P T P S G 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 



Signal peptide: 

amino acids 1-16 



m 



Q 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGGCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
s . GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
J* CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
tl ACCAGCCT TGACCAC GCCC TGGAGAGGACC T ACCAGC TGT T GGTACAGGT CAAGGACAT GGGTGA 

13 CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
m CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
fgi CAC T GGAGT GGGGG T GAT GTGCACT AT CACCT GGAGAGC CATCC CCCGGGACCC T T TGAAGT GAA 

fil TGCAGAGGGAAACC TC TACGT GACCAGAGAGC T GGACAGAGAAGCCCAGGC T GAG TACCTGC TCC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
CO ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
42 CAGTCCACCAGG TAC T GAAGTGACTAGAC T GT CAGCAGAGGATGCAGATGCCCC CGGC TCCC CCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
1~ CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
y CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
If! TCGAAGTCGCAGTCACAGATATC^LATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
Q ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
|Jj CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
JJf GCCT GGATT GGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACC TCAGT TATGAG 

y GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
ill AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CAC CCAC TGGACCAT GT CAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
G7UVGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCCT GGGAGAGAGC CCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAG TAGAAGCCCC TCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



\ 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGG.FSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRIjMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
E AAPSHE VWWQS VAKLVGPGPGPGATAT VT VL VERVMP P PKLDQE S YE AS VP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTIiAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI IPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGI FLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T CT TCCCAGGAACACAAACGTAGGAGACCCACGC T CCT GGAAGCACCAGCC T T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCT TGCT TT CCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T TCTCCT TATGTTTGGTCTACTATT GCATT TAGAAGCTGCAACAAAT TCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAAC TCTGAGT CCAGCACACC C TCCAGTGGGGC CAGCACAGTCACCAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGT AGGGCCAGCAC T GCCACCAACT C T GAGTC TAGCACAC TCTCCAGT GGGGCCAGCACAGCCAC 
CAACT CT GACT CCAGCACAACC T CCAGT GGGGC T AGCACAGCCACCAACT C T GAG TCCAGCACAA 
CCT CCAGT GGGGCCAGCACAGCCACCAACT CTGAGTCCAGCACAGT GT CCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAAC TCTGAC TC CAGCACAGTGTCCAGTGGGGCCAGCACT GCCACCAAC TC T GAG 
T CCAGCACGACC TCCAGTGGGGCCAGCACAGCCACCAAC TC TGAGT C CAGCACGACC TC CAGT GG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGTGGGGCCAACACAGCCACCAACT CTGAGTCCAGTACGACCTCCAGT GGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACT C TGAC TCCAGCACAACCT CCAGTGAGGCCAGCACAGC CACCAAC TCT GAGT CTAG 
CACAGTGT CCAG TGGGATCAGCACAGT CACCAAT TC TGAGTCCAGCACAACC TCCAGTGGGGCCA 
ACACAGCCACCAAC TC TGGGTCCAGT GT GACC TC TGCAGGC TC T GGAACAGCAGC TCTGAC TGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
T GCT CATT TAGCTAAGAAATAAATACATCTCATC TAACACACACGACAAAGAGAAGCTGTGCTTG 
C CCC GGGGT GGG TATC TAGCTC T GAGATGAAC T CAGT TAT AGGAGAAAACC T CCATGC TGGAC T C 
CAT C T GGCAT T CAAAATC TCCACAGTAAAATCCAAAGACC TCAAAAA?yy\AAAAAA7U^AAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE loo 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GAS TVTNS G S S VT S S GAS T ATNS ESS TVS S RAS TATNS E S S TL S SGAS TATNSDS S T T S S GAS T A 
TNS E S S T T S S GAS TATNS ESS T VS SRAS TATNSE S S T TS S GAS TATNSE S RT T SNGAGTATNS E S 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTS SGANTATNSGS S VTSAGSGTAALTGMHTTSHSAS TAVSEAKPGGSLVPWE I FL I TLVS WA 
AVGLFAGLFFCVRNSLSLRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTMlCGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCC'CAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AAC GAGGCAGT T TC TAGC TAC T GCACGTACTTCATAAAGCAGGACT C T AAAAGC T T TGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAACT GGGGAT GCATT TGTT CC T GGAGAAAGAAGT GTCAT TATCAT GAACCATCGGACAA 
GAAT GGAC TGGATGT TCC T GTGGAAT T GCCTGATGC GATATAGCTAC C T CAGAT TGGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCAT TTCGAAGACATGAT TGAT TACTTTTGT G 
AT AT T C AC G AACC AC T T CAAC T C C T CAT AT T C CC AG AAGGG AC T GAT C T C AC AGAAAAC AGC AAG 
T CTCGAAGTAAT GCAT TTGC T GAAAAAAATGGACT TCAGAAATAT GAATAT G TT T TACATCCAAG 
Q AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
If! AT AT CAC T GT GGCGTATCCTCACAACAT T CCTCAATCAGAGAAGCACCTCC T CCAAGGAGACT T T 



i. J 



CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTT CAACT CT GGTGCCACAAACGGT GGGAAGAGAAAGAAGAGAGGC T GCGT TC C TT C TAT CAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATAT T T G G T GGAC T GGAGAT C ATAGAAC T T G CAT G T TAC C GAC T T T T ACAC AAACAGC CACAT 
TTAAATTCAAAGAAAAATGAGTAAGATTATAAGGTTTGCCATGTGAAAACCTAGAGCATATTTTG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCAT TAT T TGT TAAAGAT AT T T TGCACT TAAT TT TGTGGGAAAAATAT TGC TACAAT T TT T 
T T T AAT C T C T GAAT G T AAT T T C GAT AC T GT G T AC AT AGC AGGGAG T GAT C G G GG T GAAAT AAC T T 
GGGCCAGAATAT TAT TAAACAAT CAT CAGGCT T T TAAA 
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FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFS PAMCLLI YL YSLVKWYFI I T I VI FVLQERI FGG 
U: LEI IELACYRLLHKQPHLNSKKNE 

o 

2*f Important features of the protein: 

I'.jj Signal peptide: 

|H amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CATCCAT CCACCC CT GCT GTCAT CT GT T T T CATAGT GT GAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CTACAGAGATGGGGAAGACT GGGAATC TAAGCAGATGC CACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
Li: AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
AT TATAG T C CAGGAAAAT GC TGGGAGCAT AT T GT GTT CCAT CCACCT T GCTGAGCAGAGTCAT GA 
JlJ GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTGACCTTGGCGCCTGGCTTCTA 
y TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
%fl AAATCCAAAGGGAAAATCCAGGCGGAACT GGACTGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

||| AGAC GCCC GGAAACACGCAG TGGAGGT GACT CT GGATCCAGAGACGGC TCACCCGAAGC TC TGCG 

If! T T T C TGAT C T GAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCT CACT CTGAGAAGAGA 

%l TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 

CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
4* AT GT GAC T T TGTC TCCCAACAAT GGG TAT T GGGTCC T CAGACT GACAACAGAACAT TT GTAT T TC 

3 ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
Q GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
?Sji T GACAT GT CAGT T TGAAGGC TT GTT GAGACCCTATAT CCAGCAT GCGATGTATGACGAGGAAAAG 

JJ-j GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
U CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
ty CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
Q TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
2%l CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAG TGAT T CC T GCC TCACAGGT GAAGAT TAAAGAGACAACGAATGT GAATCATGC TTGCAGGTT 
TGAGGGCACAGTGTT T GCTAAT GAT GT GT T T T TATAT TATACAT T T T C CCACCATAAAC TC TGT T 
TGCT TATTCCACATTAATT TACT TT TCTCTATACCAAATCACCCATGGAATAGT TATTGAACACC 
T GCT T T GT GAGGC T CAAAGAATAAAGAGGAGGTAGGAT T T T TCACTGAT T C T AT AAGC CCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AACACAGACAC AAAAAT T C T AAAT AAAAT T T TAACAAAT TAAAC T AAACAATATAT T TA 
AAGATGAT ATATAAC TACT CAGT GT GG TT T GTCCCACAAATGCAGAGT T GGT TTAATAT TTAAAT 
AT CAACCAGT GTAAT T CAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVL I LVLS FYELVSGQWQVTGPGKFVQALVGEDAVFSCSL FPE TSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGWDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LC S IHLAEQS HE VE SKVL I GET FFQP S PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWAS QG FQAGRHYWEVDVGQ WGWYVGVCRDDVDRGKNNVTLS PNNGYWVLRLTTEHL YFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE los 



CC TT CACAGG AC TCTT CATT GCT G GT TGGC AATGAT GT ATCGGC CAGATGTGGT GAGGGCT AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT TC ATT AT GTGAGAT AT AATCAAAAGAAGAC CT ACAATT ACTATAGC AC AT TGTC AT TT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
C AAT GGTG AAAAAT GC AT TT TATAAATCT CCAT TAAGGGAAG AATT TGTC AAGT CTC AGGTT AT CAAGTT C 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACT GT AG AT AAAATT GT T C AACT TGT TT T AC ATGAAAAGCT GC AAGATGCTGT AGGACC CCCT AAAGT AG 
L . ATCCT C AC T C AGTT AAAAT T AAAAAAAT CAAC AAGACAGAAAC AGACAGCTAT C T AAACC AT T GCT GC GGA 

f=* ACACGAAGAAGT AAAACT CT AGGT C AGAGT CT CAGGAT CGTTGGT GGG AC AGAAGT AGAAGAGGGT GAAT G 

|3 GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTA^TGCCACATGGC 
|fl TTGTGAGT GCTGCT CACT GT TT TACAAC AT AT AAGAACCCTGCC AGAT GG ACT GCT T CCT TT GGAGT AAC A 

|y ATAAAACCT T CGAAAATGAAACGG GGTCT C CGGAGAAT AATT GT CCAT GAAAAATAC AAAC ACCC AT C AC A 

II jj T G ACT ATG AT AT T TCT CT T GCAGAGCTT T C TAGC CCTGTTCCCT AC AC AAATGC AGT ACATAGAGT T T GT C 

T C CCTGAT GC AT CCT ATGAGTT TC AACC AGGT GAT GT GAT GT TT GTGACAGG AT TT GG AGCACTGAAAAAT 
^ p GATGGTTACAGT CAAAAT CATCTT C G AC AAGC ACAGGT GACTCT CATAGACGCT ACAACTTGC AAT GAACC 

1. TC AAGC TT AC AAT G AC GCCAT AACT CCT AG AAT G T T AT GTGCTG GCTCCT TAGAAGGAAAAAC AGATGC AT 

*% GCCAGGGTGACT CTGGAG GACC ACT GGT T AGT TC AGAT GCTAGAGAT ATC TGGT ACCTT GCT GGAATAGT G 

p| AGCT GG GGAGAT GAAT GTGCGAAACC CAAC AAGC CT GGT GTT T ATACT AGAGT T ACGGCC TT GCGGGACT G 

| s | GATT AC TTCAAAAACT GGT ATC TAAGAGAC AAAAGC CT CAT G GAAC AG AT AAC ATT TTT T TT TGTT TT TTG 

CI GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
f|| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACTCT GT C ATCTGT GAGCAAT AGTT GAAACT TT ATGT AC AT AGAG AAAT AGAT AATACAAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT AT T AGGTGTT TT TCT T AGTGGAAT AT TAGAAATGATC AT AT T CATT AT GAAAGGTC AAGCAAAGAC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCT T ATTTTC AT TTCC AAAC AACT ACT AT GAT AAAT GT GAAGAAGAT TCTGTTTTTTTGT GACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
T T ACTGAGGATGTCAAC AT AT AACAAT AAAAT AT AAAT CACCCA 
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FIGURE 106 

MMYRPDVVRARKRVCWE PWVI GLVI FI SL I VLAVC IGLTVHYVRYNQKKT YNYYS TLS FTTDKLY 
AE FGREASNNFTEMS QRLESMYKNAFYKS PLREE FVKSQVI KFS QQKHGVLAHMLLI CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPYPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI TPRMLCAGS LEGKTDACQGDSGGPLVS S DARDI WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACA ATG GCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGT CACT TTGAAAGCCAGGAT GACAAAT TATGGAT TACCT AGAT AT CGG T GGCT TAC TCA 
T GCT T GGAAT TT TT T TCAGAGAGAG T T TAAGT GCT GT GGAGTAGTAT ATT TCAC TGAC T GGT TG G 
AAAT GACAGAGATGGACT GGCCCCCAGAT TCCTGCTGTGT TAGAGAAT TCCCAGGATGT TCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
T T T GAGAGGAAC C AAACAAC T G C AG G T GC T GAG G T T T C T GGGAAT C T C CAT T GGGG T GAC AC AAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTCCT TGAAGAAT GACAAC TCTCAGCACCT GTCATGT CCCTCAGTAGAAC T 
G T T GAAAC C AAG CC T G T C AAGAAT C T T T GAAC ACAC AT C CAT G G CAAACAG C T T T AAT AC ACAC T 
T TGAGAT GGAGGAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACT TGTTTTATTGGACT 
T GT GAAT T T TT GAG TACATACT AT GT GT T T CAGAAATATGTAGAAATAAAAATG T TGCCATAAAA 
TAACACCTAAGCATATACTATT CTAT GCTT TAAAATGAGGATGGAAAAGTT TCATGTCATAAGTC 
ACCACCT GGACAATAAT T GATGCCCT TAAAATGC TGAAGACAGAT GT CATACCCAC T GTGTAGCC 
TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
T C CAT GCAAACGAGTCACATAT GGTGGGACTGGAGCCATAGTAAAGGT T GAT TTAC TTC TACCAA 
CTAG TATATAAAGTAC TAATTAAAT GCT AACATAGGAAGT TAGAAAAT AC TAATAACT T TTATTA 
CT CAGCGATCTAT TC T TCT GAT GCTAAAT AAAT TATATATCAGAAAACTT TCAATATT GGTGACT 
ACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
TGT C T TAAGCTGAT CAGGGATT T TT T GTATATAAGTC TGTGTT AAAT CTGTATAAT TCAGT C GAT 
T T CAG T T C T GAT AAT GT TAAGAATAAC CAT TAT GAAAAGGAAAAT TTGTCCTG T AT AGCAT C AT T 
AT T T T TAGCCT T TCC TGT TAATAAAGC T T TAC TAT TC TGT CC T GGGC T TAT AT TACACATATAAC 
TGTTATT TAAATACT TAACCACTAATTTTGAAAATTACCAGTGT GATACATAGGAATCATTAT TC 
AGAATGTAGTC TGGTCTT TAGGAAGTATTAATAAGAAAAT TTGCACATAACTTAGTTGATTCAGA 
AAGGAC T TGTATGC TGTTTTTCT CCCAAAT GAAGAC TC TT TT TGACACT AAACACT T T T TAAAAA 
GCT TATCT T TGCC T TC TCCAAACAAGAAGCAATAGT CT CCAAGT CAATATAAATT C TACAGAAAA 
TAGTGTTCT TT T TCTCCAGAAAAATGCTTGTGAGAATCAT TAAAACATGTGACAATTTAGAGATT 
CTTTGTT TTATT TCACTGAT TAATATACTGTGGCAAAT TACACAGATTATTAAAT TTTT TTACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T T CT CAGAATAT GGAAAGAAAAT T AAAAT GTGTCAATAAATAT T T TC TAGAGAGTAA 
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FIGURE 108 



MJ^EDSVKCLRCLLYALNLLFWMS I SVIAVSA 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVE LACGVWT YE QE LMVPVQWS DMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQM 
EDLSDLYQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVTQI LAMI LT I TLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGTCTTTCTC GT GAGAGCCT AGAGGCCTTAAAAAAAAAAGT GCTT GAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACTGACCACC AGTT GGATGAGTCT CAAGGTCCTCCT CT AT GTGACAACC AT GT GAAT GGGGAG 
T GGTACCACTT CACGGGCATGGCGGGAGATGCCAT GCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
C AG CT GC TC AGAT AC C AGC GAGT GCACATGC GCTC C AGGAACT GT GCT AGGCCCT GAC AGGCAGACAT GCTT TG 
ATGAAAATGAAT GT GAGCAAAACAACGGTGGCT GCAGTGAGATCTGT GTGAACCTCAAAAACTCCTACCGCT GT 
i s GAGT GT GGGGT T GGCCGTGT GCT AAGAAGT GAT GGCAAGACTT GT GAAGACGTTGAAGGAT GCCACAAT AAC AA 

2 5 s| TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 

CT GAGGAT AAC C AC ACTTGCCAAGT CC C T GT GT T GT GC AAAT C AAAT GCC AT T GAAGT GAACAT CC CC AGGGAG 
**■ CT GGT T G GT GG CCT G GAGC T CT T CC T GACC AAC AC CTCC T GC C GAGGAGT GT C C AAC GGC ACCC AT GT CAAC AT 

5 ' CCT CT T C T CTC T C AAGAC AT GT GGT AC AGT G GT C GAT GT GGT GAAT G AC AAGAT T GT GGC C AGC AACC T CGT GA 

f"% CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
III ACC T GC GAGTT TCCAC GCC T GT AC ACC ATT T CT GAAGGAT AC GT T CC C AACC T T C GAAAC T C CCC ACT GGAAAT 

CAT GAGC CGAAAT CAT GGGATCT TC CC ATT C ACT C T GGAGAT C T T C AAGGAC AAT GAGT T T GAAGAGC CTT ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
Q TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
HI CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 

TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCC AGAAAGAC ACT C ACCC CAT T TCCC T CAT TT CTTTCC T AC ACT T AAAT AC C T C GT GT AT GGT GCAAT C AGAC 
C AC AAAATCAGAAGCT GGGT AT AAT AT TTC AAGTT ACAAACCCT AGAAAAAT T AAACAGTTACT GAAAT TAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAAT TT GGAAGT GTAT CAAT AAAACAGT AT AT AATTTT 



i „ S 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCI PENHCGTHAPVWLNGSHPLEGDGI VQRQACAS FNGNCCLWNTTVEVKACPGGY Y VYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNI PRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLY FGIEPWHV 
SGLESLVESCFATPTSKIDEVLBCYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 



f B | Important features of the protein: 

|f| Signal peptide : 

amino acids 1-16 



N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



N-myristoylation sites. 

f|| amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxy la ti on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGCAGC AGCT TGCT C AGCGGACAAGGATGC TGGGCGTGAGGG AC CAAGGCCT GC CC T GCACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCT TGGGGT GACAAT CT CAGCT C CAGGCT AC AG GGAGACCGGGAGGATC ACAGAGC CAGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCT T C AGAAAGGT GGGGAT CCCC AT CATC AT AGC ACT ACTG AGCCT G GCGAGT AT C ATCAT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCT GT GT GACG GAGAGCT GGACT GT CC CT TGGGGG AGGACGAGGAGC ACT GT GTC AAGAGCTT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
13 AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
V GCT AC AGC AG AGCT GT GGAGAT TGGCCC AG ACC AGG ATCT GGAT GT TGT T GAAATC AC AGAAAACAGC CAG 

"fl GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
~ TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T C AGCATCC AGT ACGACAAACAGC ACGT CT GTGGAGGGAGC ATCCT GG AC CCCC ACT GGGTCCTC ACGGC A 
J[* GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
% CTTCCCATCC CT GGCT GTGGCCAAG ATCAT CAT C ATTG AAT T CAAC CCCATGT ACCCC AAAGAC AATGAC A 

Q TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
if! GATGAG GAGCTC ACT C CAGCCACCCCAC TC TGGATC AT TGGAT GGGGCTTT ACG AAGC AGAATGGAGGGAA 

GATGTC TGACAT ACTGCT GC AGGCGTCAGT CC AGGT CATT GACAGC ACACGGT GC AATGC AG ACGATG CGT 
ACCAGGGGGAAGTC AC CGAGAAGAT GAT GT GT GC AGGC AT CCCGGAAGGGGGT GT G GACACC TGCC AGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGC CC GAGC ACCCC AGGAGTATACACC AAGG TC TCAGCCTATCTCAACT GGAT CT ACAATGT CT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CCCC CAAAGT CAGACACAGAGC AAGAGT CC CCT T GGGT AC ACCCCT CT GCCC AC AGCCT CAGCAT 
T T CT TGGAGC AGC AAAGGGCCTCAATTCCT GT AAGAGACC CT CGCAGCCC AGAGGCGCCC AGAGGAAGTC A 
GCAG CCCT AGCT CGGC C ACACT TGGTGCTC CC AG CATCCC AGGG AGAGAC AC AGCCCACT GAAC AAGGTCT 
CAGG GGTAT T GCT AAGCC AAGAAGGAACTT TCCC AC ACT ACT GAAT GGAAGC AGGCT GTCT TGT AAAAGC C 
CAGAT C ACTGTGGGCT GG AGAGGAGAAGGAAAGGGT CTGCGCCAGC CCT GTCCGT CTTC ACCCATCCC CAA 
GCCT AC TAGAGC AAGAAACC AGTTGT AAT AT AAAAT GC AC TGCC CT ACTGTT GGTATGAC TACCGT T ACCT 
ACTGT T GT CATT GTT ATT AC AGCT AT GG CCACT ATT AT T AAAGAGC TGT GTAAC ATCT CT GGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRI PME T FRKVG I P 1 1 IALLS LAS 1 1 1 VWL IKVI LDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDM 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS I LDPHWVL TAAHC FRKH T DVFNWKVRAG S DKL 
GSFPSIiAVAKI 1 1 IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMSDI LLQASVQVIDSTRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
T GAACTGCTAGGAT TC TGAC TATGC T G TGGT GGCTAG TGC TCCTACT CC TACC TACAT TAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCC CAAAC CTGACC T TCAC TC T GGAACGAGAACAGAGGT T TCTACCCACACCGTCCCC TCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAAC T T GT GCCAAAT TAT GGGTCAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C T CAAT TTAAAT CATGT TC TAGTAAT TGGAGCTGTCCCCAAGACCAAAGGAGC TAGAGC T TGGTT 
CAAATGAT C T CCAAGGGCCC TT ATAC CCCAGGAGACT T TGAT TT GAAT T TGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTAC TCGGGAGGCTGAGACAGGAGAAT TAC TT GAACC TGGGAGGT GAAGGAGGC TGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAATTA 
TGGTTATTTGTAA 
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FIGURE 114 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE iir 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACC ATGG CAA 
AGAAT CCT CC AGAGAAT TGT GAAGAC TGTCACAT TC TAAATGCAGAAGC TT T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T CT ACAGCAATGGAGAGAAGAAGAAGAT TTACAT GGAAAT TGAT CCT GT GACCAGAAC T GAAATA 
T T CAGAAGCGGAAAT GGCAC TGAT GAAACAT TGGAAG TGCACGAC T T TAAAAACGGATACAC TGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T GAACCAGAAGAGGAAATAGATG AGAAT GAAGAAAT TAC CACAACT T T CT T TGAACAGTCAGT G 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT T T GTGATAACGTGACCATGTATT GGAT CAAT CCCACTC TAATAT CAGT T TC TGAGT TACAAG 
ACTT T GAGGAGGAGGGAGAAGATCTT CACT T TCC T GCCAACGAAAAAAAAGGGATT GAACAAAAT 
G AAC AG TGGGTGGTCCCT CAAG T G AAAG T AG AG AAGAC C C G T C AC G C C AG AC AAG C AAG T G AGGA 
AGAACT TCCAATAAAT GAC TATACTGAAAAT GGAAT AGAAT T TGATCC CAT GC TGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T T AT C AC C AAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSE PEEE I DENEE I T TT FFEQSVI WVPAEKP IENRDFLKNSKI LE I CDNVTMYW INPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTC CCCT CAGGAGCGCGTT AGCT TC AC AC CT T CGGCAGC AGGAGGGC GGCAGCT TCT CG C AGGCGGC A 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGCCAACAT G AC ACT GACCT C CGGGATC AT GT TCAT TGT C TC AGGT CT T T GT GC AAT T GC TGGAGT G 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
i =% GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
?*f. TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
n% GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
pj TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
Iff CTTCCAAGCACGACTATGTG TAAT GCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
III CCCAAAAAAC AAGG AG AT CCCATCT AGATT TCT T CT T GCT TT TGAC TCAC AGCTGG AAGT T AGAAAAGCC T 

CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
3 GCTGAGTT AT TT AT GAAT TAGAGGCT AT AG CT CACAT TT T CAATCCTCT ATT TC TT T T TT T AAATAT AACT 

?!? TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 

in 

J«| TTCCTCCT AGTC AAT AAACCCATTGAT GAT CT AT TT CCCAGC TT AT CC CCAAGAAAAC T TTT GAAAGG AAA 

%?i GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

:i,.L': 

f*% AC ACTT ACTGAAGAAG AAGCAATAAGAGAAAGAT AT T TGT AATCTCTCCAGCCC AT GAT C TC GGTT T T CTT 

||| ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAAT TGTCCCTAGAT GAATGAGAAAAT TAT TT T TTT TAAT TT AAGT CCTAAAT AT AGT TAA 
AATAAATAAT GT TTT AGT AAAATGAT ACAC TATCTCT GT GAAAT AGCCTC ACCC CT AC AT GT GGAT AGAAG 
G AAATG AAAAAATAAT TGCTTT GAC ATT GT CT AT ATGGT ACTTTGT AAAGTC AT GCT T AAGT AC AAAT TC C 
AT GAAAAGCT CACACCTGT AAT CCT AGC ACTTTGGGAGGC TGAGGAGGAAGG AT CACT TGAGCCC AGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
T GGT GGCAT AC ACCTGTAGTCCCAGC AT TC CGGGAGGCT G AGGTGGGAGGAT CACT T GAG CCCAGGGAGGT 
T GGG GCTGCAGT GAGCCAT GAT CAC ACC AC TGCACT CC AG CC AGGT GACATAGC GAGATC CTGTCT AAAAA 
AAT AAAAAAT AAAT AATGGAACACAGCAAGTCCTAGGAAG TAGGTT AAAACT AATT CT TT AA 
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FIGURE 118 



MS TTT CQWAFLLS ILGLAGC I AAT GMDMWS TQDLYDNPVTS VFQYEGLWRS CVRQS S GFTECRP 
YFTILGLPT^LQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFAJMLVTNFWMSTANMYTGMGGMVQW 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTSDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CAT CAGGAT GCAGTGCAAAAT CTATGATT CCCT GCT GGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
P ACT CAATAGTGAAT GTT GCCCAAAAAC GT GAGCTTGGAGAAGCTCTCT ACTTAGGAT GGACCACGGCA 

O CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
fl CAGATACT CGAT ACCTT CCCATCGCACAACCCAAAAAAGT TATCACACCGGAAAGAAGTCACCGAGCG 

fi TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
>; AAT CT TAATTACAGGAACTGTGCATCAGCTATTTAT GAT T CTATAAGCTATTT CAGCAGAATGAGAT A 

s TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
f!f CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
Wl ACGACATAGCATTATGTACATAGATGAGT GTAACATT TATATC T CACATAGAGACAT GCT TAT AT GGT 

W TTT ATTTAAAATGAAAT GCCAGT CCATTACACT GAATAAATAGAACTCAACTATTGCTT T TCAGGGAA 

% AT CATGGATAGGGTT GAAGAAGGT TACTATTAATTGT TT AAAAACAGCT TAGGGAT T AAT GTCCTCCA 

lip TTT AT AAT GAAGATTAAAATGAAGGCT TTAAT CAGCATT GTAAAGGAAATT GAATGGCTTTCT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT TTTTGTTTTT GTAT TT GAAGAAGAATGAT GCAT T TTGACAAGAAAT CATATAT GTATGGAT 
ATAT T TTAATAAGTAT T TGAGTACAGACTTT GAGGT T TCAT CAATATAAATAAAAGAGCAGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACATT TATATTAATAAATTGTACATT TTTCTAATT 
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FIGURE 120 



mTHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
IYDSLLALSPDLQAARGLMCAASVMS FLAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MWL I PVS WVANAI I RDFYNS I VNVAQKRELGEAL YLGWTTALVL I VGGALFCCVFCCNEKS S S Y 
RYS I PSHRTTQKS YHTGKKS PSVYSRSQYV 

Signal peptide: 

amino acids 1-17 



Q 

|3 Transmembrane domains: 

111 amino acids 82-101, 118-145, 164-188 

W 

m 
is 

3 

o 
in 
o 
w 
o 
ni 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATT CAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGC T T T GAGGAG T CCT GGACACCCAAC TAC 
AAGCAGTGT T CATGGAGT TCAT T GAAT TAT GGCATAGATCT TGGGAAAATT GCGGAGT GTACAT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGGTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT TGAAGC TATAATT TATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAAT TAATAT TCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T T GGCACT TGT TCAGAT TACCCAAAAGGAGATGCT T C TACTGGAT GGAAT T CAGT TTC TCGCAT C 
AT TAT TGAAGAAC TACCAAA ATAAA T GCT T T AAT T T T CATT TGCT ACC TCT T T T T TTAT TATGCC 
TTGGAATGGTTCACTTAAATGACATTTTAAATAAGTTTATGTATACATCTGAATGAAAAGCAAAG 
CTAAATAT GT T TACAGACCAAAG TGT GAT T T CACAC T GT T T TTAAAT C T AGCAT TAT T CAT T TTG 
CT TCAAT CAAAAGT GGT TT CAATAT T T TT T T TAGTT GGT TAGAATAC T T TC T TCATAGT CACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGC TAC GAAT C T T T GTACAA.T T TGTAAATGT TAAGAATT T T T T T TATATC TGT 
T AAAT AAAAAT TAT TTC C AAC A 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TG C C T GG AAGAAT ACAT CAT GT T T T T C GATAAGAAGAAAT T G TAG GAT C C AG T T T T T T T T T T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCC TAAGCCAAAGCAAAAGACC TAAGGACGACC T T T GAACAATACAAAGGATGGGT T TCAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
H ATCTCAGA7VATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
rl ATAACAGCC TT CAAAAAC TTAAGTAT AAT CAAT T TAAAGGGCTCAACCAGC TCACC TGGC TAT AC 

fi C T TGAC CAT AACCATATCAGCAAT AT TGACGAAAATGC T TT TAAT GGAATACGCAGAC TCAAAGA 

GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
y\ TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
I'"! CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
|f| AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
f$\ T CTT TGCTGGCATGATCAGACT CAAAGAACT TCACCTGGAGCACAATCAATT TTCCAAGCTCAAC 

CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
*** CATAGGACAGACCATGT CC T GGACCT GGAGCTCC T TACAAAGGC T TGAT T TATCAGGCAATGAGA 

* TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
O TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
||1 TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
|y TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCT^AGGAGTAAATGTG 

AT CGAT GCAGT GAAGAACTACAGCAT CT GTGGCAAAAGTAC TACAGAGAGGT T TGATCT GGCCAG 
Ul GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
Q TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
ril TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGT CCCT AAAGCAAATGAC TCCCAGCACCCAGGAAT T T TATG TA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGC TGC TGAAT GGGACGGGACCC TGCACC TA 
T AACAAATCGGGC TCCAGGGAGT G TGAGGTATGAACCATT GTGATAAAAAGAGCT CT TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCAT T T T CCTC TCATACATAAT CAACCCAT TGAAAT T TAAATACCACAAT CAATGTGAAGC TT 
GAACT CCGGT TTAATATAATACCT AT TGTATAAGACCCT T TACT GAT T C CAT TAAT GT CGCAT T T 
GT T T TAAGAT AAAACT T CTT TCATAGG T AAAAAAAAAAA 



AppJD=l 0063584 



Page 273 of 318 



FIGURE 124 

MGFWIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT I ICAS PKELQGVNVI DAVKNYS I CGKST TERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC T T TC TAAAGC T T TGGCAAGGAGTGACAC CCGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAAT CCAACTGACC TAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT T GCGAT T TCGT GGTGTACATCATGCAT T T GCAAAAAT CT T AGCT GAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACT TAT GAT AGAGTGAAACAC TACT T GGTAT TGAATACACCACT T GAGGACAATATCAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACT TT TGTATAAATCATCG 
ACTGACTGC TT GATT CAGGC TGT TCAAGGT GAAGGAT TCAT GAGT CTAT AT AAAGGC T T T T TACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGT GGAG TCAGTC CATT T TAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFIANPTDLVKVQMQM 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 



m 



2 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
~f AG G AGC GGAAT G T GGAC GAT GG G C ACAT CAT T AACAT C AAT AG CAT G T CT G GCC AC CGAGT G T T A 

ff| CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
||| GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
111 AGACACAAT TCGCC T T CAAACT CC AC GACAAGGACCC T GAGAAGGCAGCT GCCACCTATGAGCAA 

m 

ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
Q TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
111! CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
O TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
fl TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

St TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

111 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC TGGCCCAGTGGATT TCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKWGCARTVGNIEEIAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPD 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



111 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T T AAT T T T TAAC TGATAGT TGTACATAT T T GGGGGTACAT G TGATAT T TGGATACATGTATACAA 
TATATAAT GATCAAAT CAGGGTAAC T GGGATATCCAT CACATCAAACAT T TAT T T TT TATT CT T T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TTCCATCCATGT TGC TGCAAAT GACAGGATT TCGT TC TTAATT T CAAT TAAAATAACCACACATG 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYT IMSLPPS FDCGPFRCRVSVAREHLPSRGSLLRGPRPRI PVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACC ATGA TCAGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
G AAT AC CAGAGGAT GC T ACAACTCTC TACC TT C AGAAC AACC AAAT AAAT AAT G CTGGG ATT CCT T CAGAT 
T TGAAAAACT TGCTGAAAGT AGAAAG AAT ATACCTATACC AC AACAGT TT AGAT GAATT T CCT ACC AACC T 
CCCAAAGT AT GT AAAAGAGT TACATT TGCAAG AAAATAAC AT AAGGACT ATC AC TT ATGATT CACT T TCAA 
. AAAT TCCCT ATC TGGAAG AATT AC AT T T AGATGAC AACT CTGTCT CTGCAGTT AGC AT AG AAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
^ GCCC AGGACT AT AGAAGAACT ACGCT TGG ATGAT AATC GC AT AT CC ACT ATTTC AT C ACC AT CTCT T C AAG 

J* GTCT CACT AGT C TAAAACGC CT GGT T CT AG AT GGAAAC CT GT TGAAC AAT CATGGT T T AGGT GACAAAGT T 

m TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

IP 

§ \ 3 TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 

||| CTT ATC TAAGGC AGCT CTATCG ACT GGAT AT GTCCAAT AAT AAC CT AAGT AATT TACCT C AGGGT ATCTT T 

in GAT G AT TTGGAC AAT AT AACAC AACT GAT T CT T CGC AACAAT CC CT GGTAT TGC GGGTGC AAGATG AAAT G 

4? GGTACGTGACTGGTT AC AATCACT AC CT GT GAAGGT CAACGT GC GTGGGCTC AT GT GCCAAGCCCCAGAAA 

* AGGT TC GTGGGATGGCT AT T AAGG ATCT CAAT GC AGAACTGT T T GATT GT AAGG ACAGT G GGATT GTAAGC 

i"l ACC ATT CAGATAACC ACT GC AAT ACCC AAC AC AGT GTATCCT GCC C AAGG AC AGTG GCC AGC T CCAGTGAC 

I p 

CAAACAGCC AGATAT T AAGAAC CCC AAG CT CACT AAGGAT C AAC AAACCAC AGGGAGTCC CT CAAG AAAAA 
CAAT T AC AAT T ACTGT GAAGT C TGTC AC CT CT GAT ACC AT TC AT AT CT CTTGGAAAC T TGCT CT ACCT AT G 
JJJ ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
fss AACAGGGG AACGCAGT GAGT ACTT GGTC AC AGCC CT GG AGCCTG ATTC AC CCT ATAAAGT AT GCAT GGTT C 

CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGC ACT AAGAAGGAC AACT CT AT CCTGGAAATC AGGGAAAC TT CT TT T CAGAT GTT ACC AAT 
AAGC AATGAACC C AT CTC GAAGGAGGAGTT TGT AAT AC AC AC C ATATT TC C T CCT AAT GGAATGAATCTGT 
AC AAAAACAATC AC AGTG AAAGC AGT AGT AACCGAAGCT ACAGAGACAGT GGTATT CC AG ACT C AG AT CAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNS VS AVS I EEGAFRDSNYLRLLFLSRNHLS T I PWGLPRT I EELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAE L FDCKDS G I VS T I Q I TT AI PNTVYPAQGQW PAP VTKQPD 
I KNPKLTKDQQT TGS P SRKT I T I TVKSVT SDT I H I SWKLAL PMTALRLS WLKLGHS PAFGS I TE T 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVTIALLALVCWYVHRNGS LFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



m 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C TCCACTACCT CAAGC TGAGTGAT CCCAAGTACC TAAGAGAGT T CCAGCT GACCCTCCAGCCCGG 



$F? GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 



CCCAGGAC T CAT TCT CAGAGGAGAGAAGT GACGTGTGCCTGGT GCAGC T GCT GGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TCGGGGAGCCTGATGC TGAAGAT GAAGAAT T AT CTAAAGC TAT T CAA 
TATCAGCAGCAT T T T TCGAGGAGAGT GAAGAGGC GAGAAAAACAATT TCCAGAT TCTCGCT CT GT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCGTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
T T T TAT AT T T T T T TAGTAGAGACAGGGTT T CATCATGT TGC TCAT GCT GGT CTCGAAC T CCT GAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 124 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGTOVLEEQLKSWEKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLRE FQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDS FSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites . 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE ias 

GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGT^AGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGAT T TGATCCCGTT CGAGT GGAT AT CAC T TCGAAAGGAAAAAT GAGAGCAAGATAT G 
TGAAT T ACAT CAAAACAT CAGAGGT TG TCAGAC T GCCCTAT CC TCTC CAAAT GAAAT CT TCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACAT GAGACGGGAAAT GGAGCAGT CAATGAATATGC TGAAT TCCAACCATGAGTT GCC TGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGT CAT CCCGACG T T GATC TCT T ACAACT GTG T AT GT T 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAAT AT T GATGTCAC TGAAT TAAT TACAGTGT C C TATAGAAAAT GCCAT TAATAAAT TAT 
AT GAACTACTATACAT TAT GTAT AT TAAT TAAAACATCT TAAT CCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWG FFPVLLLLLLSGDVQS SEVPGAAAEGSGGS GVGI GDRFKIEGRAWPGVKPQDW I SAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VWLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMM]^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



Important features of the protein: 
Signal sequence : 

amino acids 1-23 



ff| Transmembrane domain: 

IM amino acids 161-182 

m 

■ P y. N-glycosylation site. 

amino acids 184-187 



Glycosaminoglycan attachment sites* 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 



Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 127 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CT TACCTGCT GGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACC TGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
If AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
% T GCAAGCCAAAGATGGAAACAT TGACAT CAGAATC T TAAGGAGGACT GAGT CT T TGCAAGACACA 

AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
||| AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
111 CCATCAAGAAGGACC TCCGGCTCTCTCAT GCCCACATGACAT GCCAT TGT GGGGAGGAAGCAAT G 

% Sj| AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
TT T GGGGGAAC TAGACAT TCT T CT GCAAT GGAT GGAGGAGACAGAATAGGAGGAAAGT GATGC T G 
f*% CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
Ifl ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
Q CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 

TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 

lH ATGTATTTAT TTTT TTACTTGGACAT GAAACTT TAAAAAAATTCACAGATTATAT TTATAACCTG 

ill 

ACTAGAGCAGGT GATGTAT TT T TAT AC AGTAAAAAAAAAAAACCT T GTAAAT T C TAGAAGAG TGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCT T ACAAT TACTGACCAT CC CCAGTAGACT CCCCAGT CCCATAAT T GTGTAT C TTCCAG 
CCAGGAAT CCTACACGGCCAGCATGT AT T T CT ACAAATAAAGT T TTC T TTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SIANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACC T CT CT TCAGTAAAGT T GT T AT T GTT CTGATAGAT GCC T T GAGAGATGAT TTTGTGTTTG 
GGT CAAAGGGT G TGAAAT T TAT GCCC T ACACAACT TACC T TGT GGAAAAAGGAGCAT C TCACAGT 
T TT GTGGC TGAAGCAAAGCCACC TACAGT TAC TAT GCCT CGAAT CAAGGCAT TGAT GACGGGGAG 
CCTTCCTGGCTTTGTGGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCTTTTATGGAGATGAAACCTGGGTTAAATTA 
T T CCCAAAGCATT T T GTGGAAT AT GATGGAACAACC TCAT TT T T CGT GT CAGAT TACACAGAGGT 
^ GGAT AAT AATGTCACGAGGCAT T T GGAT AAAGTAT TAAAAAGAGGAGAT TG GGACATAT TAATCC 

$Z TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
\t AAGCT GAGCGAGATGGACAGCGTGC T GAT GAAGAT CCACACCT CACT GCAGTCGAAGGAGAGAGA 

Q GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
SI GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
Cj CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
JZ ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
CAAT GAGAGAGCAGT TGAGATT T T TACAT T TGAATACAGT GCAGCT T AGTAAAC T GT TGCAAGAG 
40 AATGTGCCGTCATATGAAAAAGATCCTGGGTT TGAGCAGT T TAAAAT GTCAGAAAGAT TGCATGG 

£ GAACT GGATCAGAC T GTAC TT GGAGGAAAAGCAT T CAGAAGTC CTAT TCAACCT GGGC TCCAAGG 

TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
r ^ TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
hf CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
Pi CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
13 GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
||| AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
IS TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
Jfj! CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
i U TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC T CAGAT CCACAGAGCCCAGGATCAAGGGACCCAG T GCAGTGGCAGCAGGACT GT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GAC C GAGAC AC T C AC AGC T T T GT C AT CAG GGC AC AGGC T T C C T CG GAG C CAGGAT GAT C T GT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAGTATGTAGT TACCAAAAGAAT AAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 



N-glycosylation sites. 

f*j amino acids 56-60, 194-198 

i f% 

Q N-myristoylation sites. 

W amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 

jrf! 275-281, 278-284 

i II 

Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCC TT CCAGGT TAT CGTGACGCACCT TGAAAGTC TGAGAGC TACT GCCC TACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCC TATAGAAAACAAC TGCCAGCACCT T AAGACCACTCACACCT T CAGAG TGAAGAACT TAAAC 
CCGAAGAAAT TCAGCAT T CATGACCAGGATCACAAAGTAC TGGTCCT GGAC TC TGGGAAT CTCAT 
AGCAGT T CCAGATAAAAAC TACATACGCCCAGAGATC T T C T T T GC AT TAGCCTCATCC TT GAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGT GACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGC TGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGGGT GACAGATAAAT T TGAGAACAGGAAACACATT GAAT T T T CAT T T CAACCAGT T T GCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T G AAC T AAT T G T AT AAAAAC AC CAAAC C T GC T C AC T 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



o 



CTAGAGAGTATAGGGCAGAAGGAT GGCAGAT GAGT GACTCCACAT CCAGAGCT GCCT CCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
C AG T G GAG TTTGCTGTC C ACACAT T CAAC CAAC AGAG C AAGGAC T AC TAT G C C TAC AGAC T GGG G 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAACTAGGTGTGGGAAAT TT GAAGACGACATTGACAACTGCCATT TCCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACAT CT TGAGCC TAAT CATGT AGT GTAGAT CAT TAAACATCAGCAT T T TAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE iaa 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMZ\RYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 



Important features of the protein: 
Signal peptide: 

? . _. amino acids 1-25 

b 

0 N-glycosylation sites. 

0 1 amino acids 117-121, 139-143 

i,y 

If! 

to 

a 
m 

o 
III 
o 
III 



N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 

CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGA.GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGAGATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACAT CAGGCAC TGCGCCACC T GCT TCACAGTACT T CCCAACAAC TC T TAGAGGTAG 
GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GT CCAATT CCAGAGC T GT C TGGTGAT CAC T T TATGTC TCACAGGGACCCACATCCAAACAT GTAT 
CT CT AAT GAAAT TGTGAAAGCT CCAT G TT TAGAAATAAATGAAAACACC TGA 
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FIGURE Id6 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQE LCS LKHCE FWLERGAGLRVTMHQPVLLCLLAL I WLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T TT CAGATTCAGGGGGTACATGT GAAGGT T T GT T T TATGAGTATAT T GCATGAT GCT GAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of* the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGC GTCACAG GAACT T CAG CAC C CAC AG GGC G GACAGC G C T C C C CT C T AC C T GGAGAC TT GAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGA ATG GGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACAT AATAGAGAATTAAGTGCAGAZ^AGACCT TT GAATGAACAGATTGCT GAA 
GCAGAAGAAGACAAGAT TAAAAAAACATATCCT CCAGAAAACAAGCCAGGT CAGAGCAACTAT TCTT T 
%-i TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
Q T AAGAAG C T CC C CAC T T GAT AAT AAGT T GAAT G T G G AAG AT GT T GAT T C AAC C AAGAAT C G AAAACT G 

Q ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 

rf! 

T TCAACTAGACGGGACTCCTT TAACCGCT GAAGACAT TGT CCATAAAAT CGCT GCCAGGAT TTATGAAG 

f~i AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 

C AAG CACATAC AC TG GAAGAT GAAGTAG CAGAG GT T T TAC AAAAAT T AAT C T CAAAG GAAGCC AACAA 
J* TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
3 T GAC T CC AATGG CAGC AATT C AAGAT GGTCTTGC T AAGG GAGAAAAC GATGAAAC AG TAT C TAAC AC A 

O T TAAC CT T GAC AAAT GG C T T GGAAAGGAGAACTAAAACC TACAGT GAAGAC AAC T T T GAG GAACT CCA 

ji ATAT TTCCCAAATTTCTATGCGCTACTGAAAAGTAT TGATT CAGAAAAA.GAAGCAAAAGAGAAAGAAA 

C ACT GAT T ACTAT CAT GAAAACAC T GAT T GACT T T GT GAAGAT GAT GGT GAAATAT GGAACAAT AT C T 
12 C CAGAAGAAGGT GT T T C CT AC CT T GAAAAC T T GGAT GAAAT GAT T GC T C T T C AGAC CAAAAACAAG C T 

■pj AGAAAA?VAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 

CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GAT GATAACT C C AAC CC AGGAGGAAAGACAGAT GAAC CCAAAGGAAAAACAGAAGC CTAT T T GGAAG C 
CAT CAGAAAAAAT AT TGAAT GGTT GAAGAAACATGACAAAAAGGGAAAT AAAGAAGATT AT GACCT T T 
CAAAGATGAGAGACT TCATCAATAAACAAGCT GATGC TTAT GT GGAGAAAGGCATC CTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCAT T TATAGCAGCCTGTAAAAAT GGCAAAAGAT CCAGGAGTCT TT CAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GT T AT TAGAAAGT G C T GAAT T TACAGT AGT T AACC T T T TACAAG T GGT T AAAACAT AGC T TTCTTCCC 
G TAAAAAC TAT C T GAAAGT AAAG T T GT AT GTAAGC T GAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAYFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEA^YEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
h* EEAEAIKRIYSSL 

iSi N-glycosylation sites: 

III amino acids 68-71, 346-349, 350-353 

---.li Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

Ih 220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 

341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 



Fil 
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FIGURE lsi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
AT AATAACCAGC T TCTAGC T GGAGGGC T GCATGCAGGGAAGGTCAT T AAAGG TGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
y„, TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
ft% CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
J? CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
*=f T AAT GG T AACT GAC AAG T G T T AC C C T GAG C C CC GCAGGCCAACC C AT C C CCAG T T GAGC C T TATA 

yt GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
ly GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
|f| TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
Xl GAAAT GGC T CGAGCTCAGAAGATAAAAGAT AAGTAGGGTAT GCT GAT CC TC T T TTAAAAACCCAA 

GATACAAT CAAAATCCCAGATGC T GGT CTC TAT T CCCATGAAAAAGT GC TCAT GACATAT T GAGA 
4* AGACCTAC T TACAAAGTGGCAT ATAT TGCAAT T TAT T T TAATT AAAAGATACC TAT T TATATATT 

s TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
1=1 AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
{fl ATTCC TTGT ATAAAAATAAGAAAAGAAAT T AATC T TGAGGTAAGCAGAGCAGACAT CAT CTC T GA 

TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
O T TGTAGT AGTGATCAGGAAACAGATC TCAGCAAAGCCAC TGAGGAGGAGGC T G TGC TGAGT T TGT 

Id! G T GGC TGGAATC T CTGGGTAAGGAAC T T AAAGAACAAAAATCAT C TGGTAAT TC T T TCC TAGAAG 

p.| GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
? ^ GT CTC TGCAGAT GTAGT TAGTT AAGACAAGG TCATGC TGGATGAAGG TAGACC TAAAT TC AATAT 

GACTGG T T T CCT T GTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TAT GTA 
AAGAT GAAGGCAGAGATCGGAGT T TT GCAGCCACAAGC TAAGAAACACCAAGGAT T GT GGCAACC 
AT CAGAAGC T T GGAAGAGGCAAAGAAGAAT TC TTCCC TAGAGGC T T TAGAGGGATAACGGCTC T G 
C T GAAACC T TAAT C T CAGAC TT CCAGCCTCCTGAACGAAGAAAGAATAAAT T TCGGCT GT TT TAA 
GCCACCAAGGATAATTGGT TACAGCAGCT C TAGGAAAC TAATACAGCT GCTAAAATGAT C CC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAJVAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGT TATAAAAGACAC TGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCT AGC T GCCATGC TATGAGCAGGCCTATAAAGAGACT TACGT GGTAAAAAAT GAAGTCT CCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGTTGCTCAGTT TTGGTCTAACT TGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALC FRMKDSALKVL YLHNNQLLAGGLHAGKVI KGEE I S WPNRWLDAS LS PVI LGVQGGS 
QCLSCGVGQE PTLTLE PVNIME L YLGAKESKS FT FYRRDMGLT S S FE SAAYPGWFLC T VPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

H Interleukin-1 signature. 

y amino acids 111-131 

o 

hi Interleukin-1 proteins. 

Ill amino acids 8-29, 83-120, 95-134, 64-103 

asp 

m 

o 

•Ep.;;' 

w 
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FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGT GGAGAGATCAAAGCAATTGGAGAAC TGGATT T GCTGT T T AT GT C TCTGAGAAAT 
GCCTGCAT T TGA CCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCT TTCCCTGC TAGAAATAA 
CAAT TAGATGCCCCAAAGCGATT TT TT TTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGG TAGAC TT T C TAAGCATAGAT AT T TAT TGATAACAT T T CAT T G TAACTGGT GT TC 
TATACACAGAAAACAATT TATTT TTTAAATAAT TGTCTTT TTCCATAAAAAAGATTACTTTCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTATTTATTATTATTATAAGACTGCATT TTAT TTATATCATT T TATTAATATGGATTTATT TAT 
AGAAACATCAT TCGATAT TGCTACT TGAGTGTAAGGCTAATATTGATATTTATGACAATAAT TAT 
AGAGCTATAACATGTT TATT TGACCTCAATAAACAC TTGGATATCCC 
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FIGURE 154 

MAALQKS VS S FLMGTLAT S CLLLLALLVQGGAAAP I S S HCRLDKSNFQQP Y I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-10.1 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE iss 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
| s i GCCCAACCGCCACCGAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
Q CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 

W TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 

f« 

r; GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 

2 S :: 

III AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 

g| GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

4*1 ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 

* sa CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

Hi 

i'4 GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 



GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 



TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
1 y TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 

CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAI S PWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 152 



CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGAC T TGAGGGACCTCCGAGTAGAACC TGT TACAAC TAGTG TT GCAACAGGGGAC TAT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T T GT G T GACGGGCAAAAGCAACT T CCAGTCCTACAGC T GT GT GAGG TGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCAT TGGGGCCCATAATATTCCTAAT GCAAATATGAATGAAGATGG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGT GAAC T T CACAACCAC TCCC CT GGGAAACAGATACATGGC T CTTATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAAGAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CAT GGGT GCT GGTGGCAGGGAT C TAT C TAAT GT GGAGGCACGAAAGGAT CAAGAAGACT TCC TT T 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
T CACACAAT T TGTTAC T T CACT GAAT TTCTT CAAAACCAT TGCAGAAG T GAGGTCAT CC TTGAAA 
AGT GGCAGAAAAAGAAAATAGCAGAGAT GGGTCCAGT GCAGTGGC TT GCCAC TCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
G T GAT C T AAGAAG C CAGAT T CAT C T GC ACAAATACG T G G T GG T C T AC T T TAGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTG TAG 
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FIGURE 158 



if" 



MSLVLLSLAM.CRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMWSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKKTSFSTTTLLPPIECVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYWVYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 



111 Signal peptide: 

|l| amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 



N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAACATGACAGT GAAGACCCTGCATGGCCCAGCCAT GGTCAAGTACT TGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACGTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACTCAGCT GAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGIAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNI E S RS T S PWNYTVTWDPNRYPSEWQAQCRNLGC I NAQGKEDI SMNS VP IQQETLVV 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 



Signal sequence: 

amino acids 1-30 



I s * N-glycosylation site. 

amino acids 83-87 

CI 

m 

1^1 N-myristoylation sites. 

Ill amino acids 106-111, 136-141 

CO 
C3 

irl ■ 

O 

w 
a 
m 



App_ID=l 0063584 



Page 310 



FIGURE 161 



ACACT GGCCAAACAAAAACGAAAGCAC T CCGTGC TGGAAGTAGGAGGAGAG T CAGGACT CCCAGG 
ACAGAGAGT GCACAAACTACCCAGCACAGCCCCC TCCGCCCCCT C T GGAGGCTGAAGAGGGAT TC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGATGAGGAAAAGT T T GGAGGAGCAGC TGACT CAGGGGTGGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTC.CAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
y« TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
jplj TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
t% GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
j*f ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
Cfl GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
|y CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
Ifl CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 

CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
# s: TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
3 AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
O CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
||«- GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
JtJ CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
M CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
Ifj CTGCGCTTTCCCTCATCCTCCTTCTCA^AAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
Q CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
pi CTCGG.GTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
sw CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 

CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCGGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C G G G AC T TAAA T AAAGGC AG AC GCTGTTTTTC T AAAAAAA 
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FIGURE 162 
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MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQICETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVWDCFEAALGSEVRIWSYTQPRYEKEIJSrHT^^ 

EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY IHKRWALVWLACLLFAAALSLI LLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL Y SAD DS G FE RLVGALAS ALCQ L PL RVAVDL W S RRE L SAQG P VAW FH AQ RRQTLQ EGGVWLL FS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 
amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

TTCTACCACT TAGAGCTCCAGGTCAACCGCACCTACCAAATGCACC TTGGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACAT GT GCCGAGTGAAGACACT GCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACGAGC 

CCCT GGGGAT T TGCACAGACAGAACAT CT GACCCAAAT GT GCT ACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCT GGAAAAGAACCAGGCC TGGAAAAGAACCAGAAGGAGGCT GGGCAGAACCAGAACAACC TGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 

GAAAT GCAGGTCCACCAGGGAGGGAGACACACAAGCC T T T TCTGCAGGCAGGAGT T TCAGACCCT 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMI VHPTP TP I RAGDGHRLTLED I FHDL FYHLELQVNRT YQMHLGGKQRE YE FFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGT GT T GAACACAGTT T T GGATAT T TT CCAAAAGAT T TGATCAAGGT ACT T CATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTTTGTCTGCTTTGAAGGAGGAAGAGATGATT 
T TAATAGT TATAAT GTAGAAGAGCT T TTAGGATC T T TGGAAC T GGAGGAC TCTGTACC TGAAGAG 
TCGAAGAAAGC TGAAGAAGT TTCTCAGCACAGAGAGAAATCTCCTGAGGAGTC TCGGGGGCGTGA 
ACT TGACCCT GTGCC TGAGCCC GAGGCAT TCAGAGC TGAT TCAGAGGAT GGAGAAGGT GCT T TC T 
CAGAGAGCACC GAGGGGC TGCAGGGACAGCCC TCAGCT CAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGTGCG T TAT T 
CAT TACAGCAAAGGATT TCGT TGGCATCAAAATCTAAGT TTGT TTTACAAAGATTGTT TT TAG TA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVXGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 



App_ID=l 0063584 



Page 316 



FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCG,GGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCGCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAZ\AACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
C AAG TAT G AACC AGAG AT T AAT C C AAC AG C C C C T G T AG AAAAG C C T TAT C T T AC AAAT C AAC C AG 
GAGACAC C CAT C AGAAT GT GG T T G T T AC T GAAGCAGGT AT AAT T C C CAAT C TAAT T TAT G T T GT T 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 

G GAT C T G T AT AAGGAAT GGC AT C AGAACAATAGC T T GGAAT GGC T T GAAAT C AC AAAGGAT C T GC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAZ\AGGATGCACCCAA 
AC T T C AAAC T T CAAG CAAAT GAAAT GGACAATGCAGATAAAGT T G T TAT CAAC AC G T C G GGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAAC TTAT TACTGT TGTCAAC TGAAT T CACACACACACAAATATAGTACCATAGAAAAAGT T TGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAASlCATTTTACTTAGAGGCAAGGAT 
T GT C TAAT T T CAAT T G TGCAAGACAT G T GC C T TATAAT TAT T T T T AGC T T AAAAT TAAAC AGAT T 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
T GACAT ACACAAT ATAAATC AT AT GT C T T CACACG T T G CC TAT ATAAT GAGAAGCAGC T C T C T GA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAA.CAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAZyyy\AAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHN.YICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 

STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 8 6-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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